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Quantum ChromoDynamics 



Michael H. Seymour 

University of Manchester, UK, and CERN, Geneva, Switzerland 
Abstract 

These lectures on QCD stress the theoretical elements that underlie a wide 
range of phenomenological studies, particularly gauge invariance, renormal- 
ization, factorization and infrared safety. The three parts cover the basics of 
QCD, QCD at tree level and higher order corrections. 

1 Basics of QCD 
1.1 Introduction 

QCD is the theory of the strong nuclear force, one of the four fundamental forces of nature. It describes 
the interactions of quarks, via their colour quantum numbers. It is an unbroken gauge theory. The gauge 
bosons are gluons. It has a similar structure to QED, but with one important difference: the gauge group 
is non-Abelian, SU(3), and hence the gluons are self-interacting. This results in a negative /^-function 
and hence asymptotic freedom at high energies and strong interactions at low energies. 

These strong interactions are confining: only colour-singlet states can propagate over macroscopic 
distances. The only stable colour singlets are quark-antiquark pairs, mesons, and three-quark states, 
baryons. In high energy reactions, like deep inelastic scattering, the quark and gluon constituents of 
hadrons act as quasi-free particles, partons. Such reactions can be factorized into the convolution of 
non-perturbative functions that describe the distribution of partons in the hadron, which cannot be cal- 
culated from first principles (at present) but are universal (process-independent), with process-dependent 
functions, which can be calculated as perturbative expansions in the coupling constant a s . 

Beyond leading order in a s , the parton distribution functions and coefficient functions become 
intermixed. They can still be factorized, but the parton distribution functions become energy-dependent. 
Although the input distributions at some fixed energy scale still cannot be calculated, the energy depen- 
dence is given by perturbative evolution equations. 

In sufficiently inclusive cross sections, called infrared safe, the non-perturbative distributions can- 
cel and distributions can directly be calculated in perturbation theory. Non-perturbative corrections are 
then suppressed by powers of the high energy scale. The most important examples are jet cross sections, 
where jets of hadrons have a direct connection to the perturbatively-calculable quarks and gluons. 

This course will attempt to give a brief overview of the subject. The approach will be pretty 
phenomenological, with most results stated rather than derived. I will however attempt to sketch in most 
cases roughly how they would be derived. One thing I will not have time to go into in much detail will 
be heavy quarks: in most cases we will treat the d, u, s, c and b quarks as massless and neglect the top 
quark, an approximation that I will motivate in Section IT~9l 

It is hard to give a better introduction to the subject than the book 'QCD and Collider Physics', by 
Keith Ellis, James Stirling and Bryan Webber] 1 1. So I will follow the ESW approach and notation pretty 
closely. In most cases they will be able to give you a few more details and references to much more 
detailed treatments if you want to go further. For a much more detailed treatment of the formulation of 
QCD and its renormalization in particular Peskin and Schroeder[2| is also unbeatable. 

As there are many parallels with QED I will have to assume prior knowledge of the basics of QED 
and that you can calculate a few simple cross sections. However we start by recapping a few features. 



1.2 Basics of QED 

QED is a gauge theory with gauge group U(l). It can be derived using the gauge principle. The classical 
Lagrangian density for n types of non-interacting fermion is 

n 

Cierm = ^2fi(ift-mi)fi, (1-1) 

i 

where is a spinor- valued wave function describing plane waves of momentum pi, fi its Dirac conjugate 
fj 7 , $ is shorthand for j^a^ and 7^ are Dirac spinor matrices with anticommutation relation 

{7", 7"} = 2sT. (1.2) 

The Lagrangian density (11.11) is invariant under global changes of gauge, 

fi-> fi= exp(iej6»)/i, (1.3) 

where is an arbitrary flavour-dependent parameter, which will turn out to be proportional to electric 
charge. We can derive QED by asking how we would need to modify (11.11) to make it also invariant under 
local changes of gauge, 

fi(x) -> fi(x) = exp(ieiO(x))fi(x). (1.4) 

This can be done by introducing a new vector- valued field A fl , which transforms under the same change 
of gauge like 

A^x) -» A'^x) = A^x) + ^(a M exp(i0(x))) exp(-i0(a:)), (1.5) 
and by replacing the derivative by the covariant derivative, 

D fl = d tl + ieQA ll , (1.6) 
where Q is the charge operator, defined by 

Qfi = e l f i . (1.7) 
Since A^ is a new field that we have introduced, we must make it physical by adding a kinetic 

term, 

£kin = -\F^F^, (1.8) 

where the field strength tensor F^ v is defined by 

= d^A u - d v A^. (1.9) 
The classical QED Lagrangian density is therefore given by 

1 n 

^classical = ~^F^ + £ f&p - Kit) ft. (1.10) 

i 

This is now invariant under local changes of gauge. 

Perturbative calculations are made according to the Feynman rules. These can be read off from 
the action, defined by 

S = i [ d A xC. (1.11) 



bilinear term in A^: 



There is however one complication. The photon propagator A 7i/W/ (p) is derived from the inverse of the 

A 7 , M „(p) x i\p 2 g™ - pVl = 6? (1.12) 
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This does not have an inverse. However, we can exploit the gauge invariance of the theory to rewrite 
it in a physically equivalent form that is invertible. Since the Lagrangian density is gauge invariant, we 
can choose some convenient gauge to work in and the final answer should be independent of which we 
chose. For example, in the covariant gauge, we have the condition 

&"A^ = (1.13) 
at every space-time point. We can therefore add an extra term to the Lagrangian density 

1 2 

£gaugc— fixing = 7J/7^ {d^A^) , (1.14) 

where A is an arbitrary parameter, and provided we work in a covariant gauge we cannot have changed the 
physics, since we have only added zero. (This is essentially just the method of undetermined Lagrange 
multipliers for minimizing an action subject to a constraint: the constraint is d!.13t and the multiplier is 
1/2 A). The final results must clearly be independent of A, although it will appear at intermediate steps of 
calculations. Common choices are A = 1 (Feynman gauge) and A — > (Landau gauge). For arbitrary A, 
we must now solve 

A 7 ,^(p) x i\p 2 g v ° - (1 - \)p v p°] = (1.15) 



A' 



which yields 



i 



A 7 ^ = ^^ + (1-A)»j. (1.16) 

Clearly the Feynman gauge offers significant calculational advantages, so we use it for most of the rest 
of this course. 

Another popular class of gauges are the axial (or physical) gauges, defined in terms of an arbitrary 
vector n, by 

1 2 

^gaugc— fixing = ( n ^A^) . (1.17) 

These have the result that an on-shell photon has two polarization states, which, in the (n+p) rest-frame, 
are purely transverse to its direction. The penalty is that the propagator becomes more complicated, 

a if . n^pv+p^riv (n 2 + Xp 2 )Pfj,pA , 110N 

&-y,lw — — —9iw H 7 V2 • U-lo; 

p \ n-p \n-pY J 

Obviously some simplification is obtained by setting n 2 = and A — ► (the "lightcone" gauge), but 
practical calculations are still considerably more complicated than in covariant gauges. In particular, if 
making a numerical calculation, it is difficult to guarantee that the spurious singularities n-p — > cancel 
as they should. 

We therefore have the Feynman rules (in Feynman gauge): 

p — rrii p z — m\ 

To calculate the cross section for a given process, we must write down all possible diagrams, 
use the Feynman rules to give us the amplitude iM., use Dirac algebra and trace theorems to calcu- 
late \ Ai\ 2 , where the sum is over all unobserved quantum numbers for example spin, divide by the 
overcounting of incoming states, and integrate over phase space: 



1 1 

a= sYs 



dr^|M| 2 . (1.22) 
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An element of ?7,-body phase space is given by 



n / ,4 \ 



n(^|kV 2 ^ 4 (^-^^. (i.24) 



For example, the cross section for e + e — ► is calculated as follows. The amplitude is 



iM = v{p e +){ie)^u(p e -) i— ^ u(p /1 -)(ie)~f u v(p tl+ ) (1.25) 



(Pe+ +Pe-) 2 

and hence 



{Pe+ +Pe-Y 
ze 2 

v{p e +)l^u{p e -) u(p jU -)7 At n(p M +) (1.26) 



J2\M\ 2 = ^^Try e+ ^ e -r} Tr{^- 7 ^ +7 ,}, (1.27) 



where a = e /Att and s = (p e + + p e ~) , or 

s 



^ I 1 2 = 16 ( 4 ™) (^+Pe- + Pe~Pe+ ~ Pe+ 'VtrST) (P^Pfi+u + P^P^u ~ P M + ^-9^) 

(1.28) 

J) 2 ^, (1.29) 



where i = (p e - — p„- ) 2 and ii = (p e - — ) 2 = — s — 1 The cross section is therefore 



l J_ r° dt_ QtA „^2 t2 + u 2 

4 2s /_. 8tts 



°- = 7 - / — 8(47ra) 2 ^^ (1.30) 



47ra 2 
3s 



(1.31) 



1.3 SU(3) and colour 

QCD can be derived in exactly the same way as QED: we start from the Lagrangian density for a set 
of non-interacting quarks and modify it in just such a way that it is invariant under changes of gauge. 
The only difference is that instead of the gauge transformation being a simple phase (U(l) group), we 
consider a non-Abelian group SU(A C ). This has several important consequences. Fermion charges will 
come in N c different types, called colours, they will be quantized (in contrast to the electric charges e$, 
which could take any value) and, most importantly, the gauge bosons will be self-interacting. 

It has been well-known since the early days of QCD that there are three colours, for example 
from baryon wave functions, the total e + e~ cross section (which is proportional to N c ) and 7r° decay 
rate (which is proportional to A 2 ). However, in most calculations it is useful to keep the number of 
colours A c arbitrary until the very last step when it is set equal to three. The A c -dependent coefficients 
are a useful diagnostic tool in understanding the physical origins of different terms, comparing different 
calculations and tracking down errors. 

We start by recapping some features of SU(A), the group of A x A unitary matrices (WU = 1) 
with determinant +1. Let U be an element of SU(A r ) that is infinitesimally close to the identity and write 
it as 

U = l + iG, (1.32) 
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where G has infinitesimal entries. It must be hermitian (G* = G) and traceless. One can choose a basis 
set of iV 2 — 1 matrices, t , A = 1, . . . , N 2 — 1, such that any G can be written as 



N*-l 



g= eAfA > (° 3 ) 

A 

where are infinitesimal numbers. Note that I will always denote colour indices that run from 1 to N 
by a and from 1 to N 2 — 1 by A. The t A are called the generators of the group and define its fundamental 
representation. You can show that is antihermitian and traceless and hence can be written as a 

linear combination of other £ s, 

[t A ,t B ]=if ABC t c , (1.34) 

where f ABC are a set of real constants, called the structure constants of the group. It is straightfor- 
ward to see that f ABC is antisymmetric in A, B, and with a little more work, one can prove that it is 



antisymmetric in all its indices. Equation (11.34b defines the Lie algebra of the group. 

We can also define a set of (iV 2 — 1) x (iV 2 — 1) matrices that obey the same algebra: 

(T A ) Bc -if ABC , (1.35) 

[T A ,T B ]=if ABC T c . (1.36) 

These define the group's adjoint representation. 

Although we started with elements infmitesimally close to the identity matrix, we can calculate an 
arbitrary element U by stringing together an infinite number of infinitesimal elements, 

U = lim (1 + i6 A t A /N) N = exp(i9 A t A ) = exp(it-0). (1.37) 

TV^oo 

Since U is unitary and t A hermitian, we have 

U~ l =exp(-it-0). (1.38) 

There are several identities we will require time and time again: 

Tr(t A t B ) = \5 AB = T R 5 AB (1.39) 
N 2 - I 

= -^S^^CpSac (1.40) 



A 



Tt(T c T d ) = Y J f ABC f ABD = N5CD ^ C A^ CD , (1-41) 
A,B 

where the constants Cf and Ca are the Casimir operators of the fundamental and adjoint representations 
of the group respectively. Although we know the numerical values of these constants: 

T R = i (1.42) 
Cf = |, (1-43) 
C A = 3, (1.44) 

it is good practice, as I said, to leave them unexpanded in all algebraic results. 

In fact for practical calculations one only requires these, and other similar, identities and never an 
explicit representation for t A or f ABC . 
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1.4 The QCD Lagrangian 

The classical Lagrangian density for n non-interacting quarks with masses mi is 

n 

^quarks = E ~ m i)abQii (1-45) 

i 

where the factor {i$ — mi) a b is proportional to the identity matrix in colour space. This is invariant under 
global SU(iV c ) transformations, 

Qa^ q' a = exp(it-6) ab q b . (1.46) 
To make it invariant under local transformations, 

q a (x) -> q' a {x) = exp(it-6(x)) ab q b (x), (1.47) 

we have to introduce the covariant derivative, 

D^ab = d fl l ab + ig s (t-A tl ) ab , (1.48) 

where are coloured vector fields that transform in just the right way that we have 

D'^ ab q' b {x) = exv(it-Q{x))abD^ bc q c {x), (1.49) 

giving 

i 

9s 

We again have to introduce a kinetic term for this new field, 



t-A'p = exp(it-6(x))t-An exp(-it-6(x)) + — (d M exp(zM9(x))) exp(-it-9(x)). (1.50) 



Cun = -\F*F%', (1.51) 

where F^ u is the non-Abelian field strength tensor. However, the definition we used in QED ( II .91 does 
not result in an invariant Lagrangian density under transformation (11.50b . One must add an extra term, 

= - d»A$ - g s f ABC A*AC, (1.52) 



and only then is dl.51l > invariant under gauge transformations. 

This extra term has profound consequences for the theory: it means that gluons are self-interacting, 
through three- and four-point vertices. This will turn out to give rise to asymptotic freedom at high 
energies and strong interactions at low energies, among the most fundamental properties of QCD. We 
therefore see that these are absolute requirements of the S\J(N C ) gauge symmetry. 

Before reading off the Feynman rules we again have to fix the gauge. This proceeds in exactly the 
same way as in QED, leading to, in covariant gauges, 

-^-gauge-fixing = ~~ (d^A^ ^ . (1.53) 

Finally, it turns out that in a non-Abelian gauge theory, it is necessary to add one extra term 
to the Lagrangian density, related to the need for ghost particles. These are beyond the scope of this 
course, but basically they arise because when a non-Abelian gauge theory is renormalized it is possible 
for unphysical degrees of freedom to propagate freely. These are cancelled off by introducing into the 
theory an unphysical set of fields, the ghosts, which are scalars but have Fermi statistics. For practical 
puiposes it is enough to know that there exist Feynman rules for ghosts and that in every diagram with a 
closed loop of internal gluons, we must add a diagram with them replaced by ghosts. It is worth noting 
that in physical gauges, as the name suggests, ghost contributions always vanish and they can be ignored. 

The final Lagrangian is therefore 

1 ^ 1 2 

£qcd = -^iT + E tiW ~ m iUi ~ 2X (P^ A f) + A*ost. (1-54) 

i 
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1.5 Feynman rules 

Just as in QED it is straightforward to read off the Feynman rules from the action. We obtain in Feynman 
gauge (only the gluon propagator is gauge dependent) 



Af 


= 6 ab * 
p - m 


A AB 


= s AB C 9 !? u 
pi 


911 


= -ig s t A r 


1 999 


= -9sf ABC \ 



5 ab i 



p + rrii 

pZ _ m *. 



(p-q) X g^ + {q-ry g ^ + {r-pYg 



(1.55) 

(1.56) 

(1.57) 
(1.58) 



Note that, apart from the triple-gluon vertex, the only difference relative to QED is in the colour struc- 
ture: propagators are diagonal in colour and the vertex for a gluon of colour A to scatter a quark of 
colour b to a quark of colour c contains (t" 4 )^. Note also that unlike QED the quark-gluon vertex is 
flavour-independent (it is straightforward to check that, unlike in QED, we cannot introduce a flavour- 
dependence into the gauge transformation, Eq. (I1.47t and retain gauge invariance). In the triple-gluon 
vertex, the three gluons have momenta p, q, r, Lorentz indices fi, v, A and colour indices A, B, C respec- 
tively. The momenta are all ingoing: p + q + r = 0. 

The Feynman rules for ghosts and for the four-gluon vertex can be found in ESW[ 1 1 (p. 10). They 
will not be needed for this course. 

Note also that in analogy with QED the strong charge g s is usually substituted by a s , 



_ 9s 



(1-59) 



1.6 e+e — ► qq 

One of the most fundamental quantities in QCD is the total e + e~ annihilation cross section to hadrons. 
We will see in a later lecture that to leading order in a s this is equal to the total e + e~ — > qq cross 
section. The calculation is very similar to that for e + e~ — > fJ- + ^~, the only difference being in the 
colour structure. The photon is colour blind, so the Feynman rule for a photon to couple to a quark 
contains a trivial colour matrix, S ab . Summing over colours and dividing by the number of incoming 
colour states (1 in this case since electrons are not coloured), we therefore obtain 

a(e + e~ -► qq) = a(e + e~ -> fi + ^ ) x e\ x ^ 5 ab 5 ba . (1.60) 



We obtain 



and hence 



S ab 5 ba = J2$ aa = N c , (1.61) 

a,b a 

„ a(e + e~ — ► hadrons) ^ , ir _ _ 

^had = i t z r = Y) e 2 q N c . (1.62) 

a(e+e — > ) V 



1.7 e+e — > qqg 

This process will be important for the higher order corrections to a(e + e~ — > hadrons) and particularly 
for the study of three-jet final states in e + e~ annihilation, among the most important test-beds for QCD. 

There are two Feynman diagrams, shown in Fig. ll.ll We label the momenta and colours e~(p_) + 
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Fig. 1.1: Feynman diagrams for the process e + e — > qqg. 

e + (p + ) — > q a (pi) + Qb(P2) + 9a(P3)- For the matrix element we obtain 

iM = v{p + )(ie)'fu{p-) ef (1.63) 

s 

u a [Pi )< {-ig s )t ab l -, ■ 72 {-iee q 7 +{-iee q )-f ■ ^{-WsFabl \ v b(P2)- 

I (P1+P3) [P2+P3r J 

We will evaluate the cross section from this matrix element later. Here we are interested in the colour 
algebra. Using the fact that the spin sum of a massless vector particle is proportional to the colour identity 
matrix, 

J2^B = ~9 flV SAB, (1-64) 

spin 

we obtain 



£ \M\ 2 oc £ ti (tl)* = J2 = E Tr (^ A ) = = C F N C , (1.65) 

a,b,A a,b,A A 

where the first step uses the fact that t A are hermitian, the second is simply a trivial rewrite, switching to 
matrix notation, the third uses Eq. dl.40t and the fourth uses the fact that the matrix being traced is the 
identity matrix of the fundamental representation, i.e. the N c x N c identity matrix. Note that since the 
colour factor of the lowest order process is N c , we can associate Cf with the emission of the additional 
gluon. Since the emission probability of a gluon from a quark is proportional to Cf, and we will later 
see that that from a gluon is proportional to Ca, Cf and Ca are sometimes referred to as the squares of 
the colour charges of the quark and gluon respectively. 

Performing the trace Dirac algebra on the matrix element, we finally obtain 

lQC F N c e^ q9 l , + + + , 

SP1-P3P2-P3 V ' 

(Note the misprint in ESW[ 1 ] - their result is a factor of 4 too large). 



1.8 The coupling constant a s and renormalization 

As we mentioned above, in practical calculations, a s is usually used rather than g s . Besides the quark 
masses, which we will neglect in most of this course, g s is the only parameter in the QCD Lagrangian 
and therefore assumes a central role in our study of QCD. However, it is not a priori clear that parameters 
in the Lagrangian are physically observable quantities - any physical observable can be calculated as a 
function of them (at least in perturbation theory) and their values can be extracted from measured values 
of physical observables, but they are not necessarily themselves physical. It is worthwhile therefore 
to consider whether we can reformulate our theory in such a way that one physical observable can be 
written as a function of another. This reformulation is known as renormalization. 
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(a) (b) (c) (d) 

Fig. 1.2: When a quark-gluon vertex (a) is corrected by a loop (b), one must decided whether to describe it as a correction to 
the vertex (c), or to the rest of the diagram (d). 

In this section I give a very handwaving description of renormalization, which I believe conveys the 
important physical point. Of course for practical calculations one needs a much more precise definition 
of the renormalization prescription, which I describe at the end. 

We redefine g s to be the strength of the quark-gluon coupling, as in Fig. ll.2h . At first sight, this 
seems like a trivial statement and at the lowest order of perturbation theory it is - the two definitions 
are identical. However, when we calculate higher orders of perturbation theory, we encounter loop 
corrections like the one in Fig. 11.2b . which correct the vertex. To avoid double-counting, we must 
uniquely decide whether these corrections are part of the vertex, as in Fig. ll.2b . or the rest of the diagram, 
as in Fig. I1.2H . One way to decide is to introduce a renormalization scale /ar and say that physics at 
high scales (therefore short distances) above lir is part of the vertex and physics at lower scales (longer 
distances) below \i r is part of the rest of the diagram. Of course, this is simply a book-keeping device, 
which does not changed the physics, it simply ensures that each physical contribution to the process is 
counted once and only once. Since \xr is a completely arbitrary book-keeping scale, introduced by hand, 
its value should not affect the physical prediction - changing it simply moves contributions between what 
we call the vertex and what we call the rest of the diagram. Since the amount of physics that we include 
in the vertex depends on \ir, and we defined g s to be the strength of the vertex, it is clear that g s must 
now be a function of fiR. 

It is worth mentioning that, although I defined g s as the strength of the quark-gluon vertex, I could 
equally well have defined it as the strength of the triple-gluon vertex. It is one of the remarkable features 
of gauge theories that, as a direct result of the gauge symmetry, I would get exactly the same result for the 
renormalized coupling g s ([iR). That is, the equality of the strengths of the quark-gluon and triple-gluon 
vertices is true even after renormalization. 

When it is clear that I am talking about the renormalization scale, I will henceforth drop the 
subscript r. 

1.8.1 Renormalization Group Equation 

As I said, varying \i moves physical contributions (loop corrections) around within a calculation, but it 
should not change the result of the physical calculation. We can use this fact to derive an equation for 
how g s varies as a function of \i. This is one of a set of equations that together describe how the whole 
theory varies with renormalization scale (and scheme), which formally form a group. 

We study this by considering a dimensionless physical observable R that is a function of only one 
physical scale Q 2 (think of i?had at y/s = Q for example). Assume that this observable is not sensitive 
to quark masses (we will return to this point shortly). After renormalization, R can only be a function of 
Q 2 , ji 2 and a s (fi 2 ). By dimensional analysis, the only way the dimensionless function R can depend on 
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the dimensionful variables Q 2 and ji 2 is through their ratio. We can therefore write 



R = R(Q 2 /fj 2 ,a s ( fJ 2 )). 



(1.67) 



We can use the fact that R, as a physical quantity, must be independent of the value of \x, and the chain 
rule for partial derivatives, to write 



fi 2 -^R(Q 2 /fi 2 ,a s ) 











dp 



,da R d 



d/j, 2 da s 
2 d n . , d 



da s 



R 
R. 



(1.68) 
(1.69) 



i.e. /3 (as) = /-^f^f- There are several points to note about this. 



A physical solution is provided by R(l, a s (Q)), i.e. by setting the renormalization scale equal to 
the physical scale in the problem. 

Q-dependence of the physical quantity R comes about only because of the renormalization of the 
theory and would not be present in the classical theory. Thus measuring the Q-dependence of R 
directly probes the quantum structure of the theory. 

By reananging Eq. (II .69b . one can derive the fi 2 dependence of a s from a calculation of R, 



P(a s 



^0 
dR 



(1.70) 



• If a s is small, R is perturbatively calculable and hence {3(a s ) is too. 
The j3 function of QCD is now known to four-loop accuracy, 

(3(a s ) = -a|(A) + ^i«s + (32a 2 + fea 2 + . . 



(1.71) 



Although the higher orders are essential for quantitative calculation, they are not for qualitative under- 
standing: almost all QCD phenomenology can be understood using the one loop result, 



11C A - 4T R N f 



12vr 



(1.72) 



where Nf is the number of quark flavours that can appear in loops, to be discussed further shortly. 

Note that @q is positive and hence that the function is negative, at least when a s is small. 
This results in asymptotic freedom: the fact that the interactions become weak at high energies (short 
distances) and infrared slavery: the fact that they become strong at low energy. 

If we neglect the higher orders, we can solve the renormalization group equation exactly, to obtain 
a s at some scale Q as a function of its value at the renormalization scale ji, 



a s (Q 2 



l + a s (n 2 )p \n^' 



(1.73) 



1.8.2 Choosing fi 2 

Although physical quantities do not depend on fi, a calculation truncated at a finite order of perturbation 
theory does. We must therefore choose some value for fi. To illustrate this, suppose that our dimension- 
less physical quantity R has a perturbative expansion that starts at 0(a s ), 

R = R ias + ..., (1.74) 
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Fig. 1.3: A measurement of a s at any scale Q fixes which curve our universe lies on, but to compare measurements at different 
scales we have to agree to label the curves in a standard way, for example using a s (M z ). 

then if we truncate at leading order, 



The leading order result in renormalized perturbation theory is the first term of this series, i.e. 7?ia s (/Lt 2 ). 
It is therefore clear that although \i is completely arbitrary, choosing it far from Q guarantees a large 
truncation error (note that the converse is not true). One should therefore choose fi 2 'close' to Q 2 , but 
how close is close? 

The conventional approach is to set /x = Q and to use the /i variation in a reasonable range, 
e.g. Q/2 to 2Q as an estimate of the truncation uncertainty. It should be clear from the foregoing 
discussion that this is an extremely arbitrary procedure. However, the folklore is that in almost all 
cases where higher order corrections have been calculated, they have fallen within the band given by this 
procedure. 

1.8.3 Measuring a s 

The j3 function tells us how a s varies with scale, but it does not tell us the value of a s at any particular 
scale: we need an experimental measurement to do that. Effectively j3{a s ) defines a family of curves, 
as illustrated in Fig. 11.31 and one measurement at any scale is sufficient to tell us which curve our 
universe lies on. However, in order to compare and combine measurements of a s at different scales, 
we have to agree on some convenient labeling of the curves. The measurement at any given scale can 
then be converted into a measurement of the label. Historically, this was often done using the 'QCD 
scale', Aqcd, described in the next section, but more recently it has been realized that the value of 
a s at some fixed scale at which it is relatively small is a lot more convenient. Since some of the best 
measurements come from Z° decays, it has become universal to use a s (M z ) as the label. We will discuss 
the measurements of a s further in Section l3.1.4l 



R ~ RiOis i 

our truncated expression for R(l, a s (Q)) can be expanded as a power series in a s (/x 2 ) 
R(l,a s (Q 2 )) « Ria s (Q 2 ) 

= Rxasifj 2 ) 1 - /3 a s (> 2 )ln% a? (fj 2 ) In 2 + 




(1.75) 



(1.76) 



(1.77) 
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1.8.4 The 'QCD Scale', A 

As I just mentioned, this is another way to label the running coupling, which is to construct a renormal- 
ization group invariant scale from a s (fi). Although the Lagrangian of massless QCD has no scale, the 
renormalization process introduces a dimensionful parameter, 



where the approximation uses only the one-loop term in the (3 function. This process by which a scaleless 
theory gets a physically observable scale by the introduction of the unphysical renormalization scale is 
known as dimensional transmutation. 

At leading order, A has a simple interpretation, it is the scale at which the coupling becomes 
infinite. However, this interpretation is not self-consistent, since it relies on a truncation of the pertur- 
bation series in a region in which the coupling is large, ultimately divergent. More generally, A can be 
viewed as a renormalization group invariant parameterization of the scale at which the theory becomes 
non-perturbative. All non-perturbative quantities, for example the hadron masses, would be expected to 
be of order A. 

However, A is not a very practically useful label for the value of a s . This is because its precise 
value, for a given measured value of a s , depends strongly on the theoretical input used in the calculation, 
for example which order of perturbation theory we truncate f3 at, which renormalization scheme we use, 
the number of flavours we assume, the way we match the running coupling at the flavour thresholds, etc. 

In principle any labeling suffers these problems, but by using the value of a s in a region where 
it is small, and where the scale is not too different to that at which the measurements are made, the 
impact on a s (M z ) is small, whereas A is related to the region where a s is large, far away from where 
the measurements are made, and these effects are large. 

1.8.5 Renormalization in practice 

To give a simple physical picture of renormalization, I have described it in terms of a cutoff on the scale 
of the physical effects that are included in different components of a Feynman diagram calculation. How- 
ever, in practice, this definition is extremely unattractive, because it breaks Lorentz and gauge invariance, 
two of the fundamental symmetries of our theory. If calculating in this scheme, these symmetries will 
get violated by a truncation at any finite order of perturbation theory and only restored in an all orders 
calculation. There are other simple schemes that work well in certain cases, for example the Pauli-Villars 
regularization, but the only known scheme consistent with all the symmetries of QCD, and hence guar- 
anteed to work at any order of perturbation theory, is dimensional regularization. In this section I give a 
very brief description of how this works in practice. The difference between fi and /i R will be (slightly) 
relevant here, so I temporarily reinstate the subscript. 

The basic observation is that the loop corrections that we have been discussing are divergent in 
four or more space-time dimensions, but are finite in less than four dimensions. We therefore choose 
to calculate our Feynman diagrams in d < 4 dimensions (we always work in Minkowski space, with 
one time dimension and d— 1 space dimensions). With a little thought, we can analytically continue 
the number of dimensions to be a complex number such that at the end of the calculation, after the 
renormalization prescription has been followed, we can let it smoothly tend back to 4 and obtain finite 
results. We therefore define d = 4 — 2e and consider the e — > 0+ limit. 

By counting the dimensionality of terms in the Lagrangian, we discover that the coupling constant 
becomes dimensionful in d / 4 dimensions. This is not very convenient, so we define a dimensionless 
parameter a s , by introducing a completely arbitrary scale fi, 




(1.78) 



(1.79) 
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where a s is the dimensionful d-dimensional coupling. Note that jjl should not be confused with the 
renormalization scale //r (but often is) because we have not yet renormalized the theory. 

When calculating loop corrections, we then find terms that have 1/e singularities for small e. 
These have the right form to be absorbed by a redefinition (i.e. a renormalization) of the coupling. Since 
we also want the renormalized coupling to be dimensionless, we have to introduce a dimensionful scale 
at which the renormalization is performed, //r. To make this concrete, at one-loop order, the prescription 
is straightforward: after calculating all the one-loop diagrams, rewrite all occunences of a s in terms of 

asUm) =a s + /3 F(e) (^\ - a 2 s . (1.80) 

Provided F(0) = 1, once this substitution has been made, the amplitude is finite. That is, the e poles 
that this expression produces exactly cancel those from the one-loop calculation. Moreover, the arbitrary 
scale \i cancels from the amplitude at this point. One is left with a finite amplitude that depends only on 
fiR and a s (/iji), in exactly the same way as discussed earlier. 

The arbitrary function F(e) = 1 + 0(e) defines the renormalization scheme. More precisely, it 
defines what finite parts of the loop amplitude are subtracted into the renormalized coupling, in addition 
to the divergent part. The MS, or minimal subtraction, scheme, is defined by subtracting nothing else, 

^Ms(e) = 1- (1-81) 

The most commonly used scheme is the MS, or modified minimal subtraction, scheme, in which one 
identifies some additional overall factors coming from the analytical continuation of the angular inte- 
grations in the one-loop calculation. Since they are universal it is convenient to subtract them into the 
coupling, 

F ms( £ ) = fjr^eT = 1 + 0*4* - 7E)e, (1.82) 

where T is the Euler gamma function and 7^ the Euler gamma constant, 7^ « 0.577216. Note that the 
two expressions on the right-hand side of Eq. (11.821) differ at order e 2 . Different practitioners use either 
of the two definitions and, although the difference is irrelevant for one-loop calculations as it vanishes as 
e — > 0, it does make a finite difference to two-loop calculations. As more and more two-loop calculations 
are being made an agreement on the precise definition of the MS scheme is needed. 



1.9 Quark masses and decoupling 

The quark masses m q are also parameters of the Lagrangian and face the same issues: for a physical 
calculation we should redefine them in a physical way. For the electron mass, we have a simple definition: 
we can isolate a single electron and 'weigh' it in the lab. That is, we can define its mass through the 
classical limit. We cannot use the same procedure for quarks, because confinement means that we can 
never take a single quark off to our lab to weigh it individually. We must therefore define some other 
renormalization procedure. 

It is possible to proceed in close analogy with the coupling strength. We renormalize our the- 
ory at the same scale \x. We encounter gluon loop corrections to the quark propagator and absorb 
the part of them at scales above \i into the definition of the mass. We therefore obtain a 'running' 
(i.e. scale-dependent) mass. Just like for the coupling, we can obtain a renormalization group equation 
with perturbatively-calculable coefficients. At leading order it can be solved exactly, to give 

1 

mifj 2 ) = M a s (fj, 2 ) %0 ° , (1.83) 

where M is a renormalization group invariant constant (c.f. Aqcd)- Note that increasing [i 2 decreases 
m 2 . Thus quarks appear to get lighter as they are probed at scales further and further above their masses. 
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An alternative scheme, which is often used in electroweak physics, and in the physics of heavy 
mesons, is the pole mass. Here one defines m q to be the pole of the propagator i{p" + m q )/(p 2 — m 2 q ) 
to all orders. This is very useful for Q ~ m q , but it turns out that it is similar to a running mass scheme 
with fi of order m q and hence generates large logarithms and a large truncation error for Q 3> m q . 

If our dimensionless observable R is finite for massless quarks then the quark mass effects must 
vanish smoothly as the mass goes to zero. Therefore the mass effects must be suppressed by (m q /Q) n , 
with n > 1. If there are quarks with mass much greater than Q, they can only affect our observable 
through loop corrections. A dimensional argument shows that such corrections must be suppressed by 
(Q/m q ) n , with?i > 2. 

These observations form the basis of the decoupling theorem, in which quarks heavier than our 
physical scale can be ignored, and quarks lighter than it can be treated as massless. Thus, for most QCD 
calculations, we work with Nf flavours of massless quark (recall the Nf that appeared in (5$). Care must 
be taken when Q is close to a quark mass, or we study a range of processes at scales that span a quark 
mass, but in fact for most of the phenomenology considered in this course we can simply take Nf to be 
fixed, N f = 5. 

1.10 Summary 

We have seen that QCD is a gauge theory. The fact that the gauge symmetry is non-Abelian predicts that 
the gluon is self-interacting. This leads to the fact that the theory becomes strongly interacting at low 
energies, and hence non-perturbative, and weakly interacting at high energies so that perturbation theory 
can be used. 

The main tools that we will use to study QCD are the factorization of non-perturbative effects and 
the renormalization and decoupling of high-energy physics. These allow us to use perturbation theory 
and, in particular, the Feynman rules, to study the phenomenology of QCD. 

2 QCD Phenomenology at Tree Level 

Leading order perturbation theory, together with the one-loop renormalization group equation is enough 
to understand a wide variety of QCD phenomenology. In this section, we briefly review the phenomenol- 
ogy of QCD before introducing the complications of loop corrections to it in the following section. Most 
of the salient ideas are introduced in the context of e + e~ annihilation and deep inelastic scattering, but 
apply equally well to hadron collisions and photoproduction, which we discuss more briefly at the end. 

2.1 The cross section for e+e — — ► hadrons 

One of the most striking features of e + e~ annihilation events is the fact that many of them produce many 
hadrons. In trying to calculate the cross section for this process however, we are immediately faced with 
a problem: the Lagrangian does not contain any information about hadrons, so there are no Feynman 
rules involving them. Even if there were, calculating all the diagrams for events involving thirty or forty 
particles would be prohibitively complicated, let alone integrating them over the corresponding phase 
space to produce a total cross section. Fortunately a simple application of the Feynman rules of QED, 
together with some simple symmetry arguments, allows us to make a surprisingly strong statement about 
the cross section for e + e~ annihilation to hadrons. 

We postulate that the matrix element for the sum of all diagrams in which a virtual photon with 
Lorentz index v and momentum q produces a particular set of n hadrons with momenta {pi . . .p n } is 
known and parameterize it by a function T u (n, q, {p± . . .p n })- Using this function, it is straightforward 
to write down the matrix element for the full process, 

M = {v(q 2 )e-f fl u(q 1 )} — — T v (n, q, {pi . . .p n }) (2.1) 
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and hence the phase-space integral for its total cross section. The total cross section to produce any 
number of any type of hadrons is then simply given by the sum of this integral over hadron type and 
multiplicity (both generically represented by Xm)> 



1 1 e 2 



2s 4 s 

x 



aTHAVriT") (2-2) 



/ dPS n T fl (n,q,{p 1 ...p n })T*(n,q,{p 1 ...p n }). (2.3) 

n ■* 

We then define a new two-index tensor, H^, to represent this sum of integrals, 

H A<l) = E / dPS - T m T »> (2-4) 

n •* 

which after the integration and summation can only be a function of q 2A . Now, there are only two 
possible Lorentz covariant two-index tensor functions of one four-vector, g^ u and q^qy. We therefore pa- 
rameterize H^y as a linear combination of these, with coefficients that are functions of the only available 
Lorentz scalar, q 2 , 

= A{q 2 )g^ + B(q 2 ) qijL q y . (2.5) 

Finally, since the theory is gauge invariant (in practice boiling down to invariance under the change 
— > e M + q^ for the polarization vector of a photon of momentum q), must be perpendicular to 
both q^ and q u , 

qt+H^ = q v H^ = 0, (2.6) 
giving a relation between the two functions, 

A = -q 2 B. (2.7) 

The final step is to realize that B(s) has to be dimensionless. Since it is a function of only one dimen- 
sionful parameter, it must therefore be constant. We therefore have the fundamental prediction that (for 
energies well above all hadron masses) the cross section to produce any number of hadrons is propor- 
tional to that to produce a muon-antimuon pair, 

, , a(e + e~ — > hadrons) 

R(e + e~) = -Vi 1 r- = constant, (2.8) 

a(e + e~ — > ^ + ^) 

without knowing anything about the interactions of hadrons ! 

In order to go further than this and try to predict this constant, or learn something from its mea- 
surement, we need a specific model of the production of hadrons. This is provided by the quark parton 
model. Of course this can be more rigorously derived, but I find it more useful to illustrate the physics 
with a space-time argument, see Fig. 12. II Since the photon is highly virtual, it is produced and decays to 
quarks in a small space-time volume, t ~ On the other hand, the wavefunction of a hadron with 

mass ~ rrihad has spatial extent ~ 1 /m^ and hence the confinement of a quark pair into the hadron 
takes t ~ 1 /rrihad- Thus there is no time for the confinement to affect the annihilation cross section and 
we expect 

a(e + e~ — > hadrons) cr(e + e~ — > quarks), (2.9) 
and the Feynman rules do tell us how to calculate that. 

21 Can you spot the flaw in this argument? It assumes that all information about the hadron momenta is washed out by the 
integration, which is only true if they are massless. In general since p\ is fixed at m\ during the integration, H also depends 
in a complicated way on the masses of all possible hadrons. In fact we will shortly justify, on the basis of a space-time picture, 
neglecting these, in the limit that q 2 is much greater than all m|. It also ignores any other masses in the problem, like the Z 
mass, which we remedy later on. 
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Fig. 2.1: Space time sketch of the production of a hadron in e e annihilation. 



In fact, we can go further than that and use an argument from quantum mechanics to postulate the 
form of the corrections to this approximation. Over a region of size ~ 1/ \Js, the amount by which the 
wave function of a hadron with spatial extent ~ l/7Ti na d, could vary is ~ m na d/ V s an( ^ tne corrections 
should be at least this to some positive power, 

a(e + e~ -> hadrons) = a(e + e~ -> quarks) x (l + O (^7^) ) • ( 2 -!0) 

On the basis of the space-time picture, we can only justify that the corrections to the quark parton model 
are suppressed by some (positive) power of the ratio of scales. In practice, n is believed to be 6 for e + e~ 
annihilation, making these corrections so small as to be almost impossible to measure. For most cross 
sections however, n is 2, and for jet cross sections, 1 . 

We calculated the cross section for e + e~ — ► qq in Section [l~6l and obtained 

_ cr(hadrons) , T 9 ^,„s 
R e +e- = — ( ^ = N c J2e 2 q , (2.11) 

where the sum over q is over all quark flavours that are kinematically allowed, i.e. for which y/s > 2m q . 
If we ignore effects close to threshold, such as the formation of bound states, we can expect a plot of 
R e + e - against sjs to present a series of steps at twice the quark masses and be flat in between. In 
principle one can read off the quark masses and charges from this plot. 

Looking at the data in Fig. 12.21 we see that the general trend is as expected, but there are clearly 
corrections that are not accounted for by the quark parton model. One of these is the effect of higher 
order QCD corrections, which we include in the next lecture. Another is the effect of the Z° boson, 
which is clearly seen at the high energy end of Fig. 12.21 which we include shortly. 

Before including the Z° contribution, it is worth remarking on a historical ambiguity that affects 
this figure. Although people wrote 

_ o"(e + e~ — ► hadrons) 
cr(e+e~ — ► /U+/i _ ) 

they often didn't actually use that formula to show their experimental results, but rather 

a(e + e~ — > hadrons) 



R 



4na 2 



3s 

using the leading order QED result for the denominator. Clearly many of the experimental and theoretical 
systematic errors would be smaller if the former was used, although of course the statistical errors would 
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Fig. 2.2: Data on R e + e - as a function of centre of mass energy, taken from ESW| 1|. The horizontal lines show the quark 
parton model expectations. 



be larger, by around a factor of 2. More recent measurements, for example from LEP, have used the more 
honest notation in which the numerator and denominator are calculated or measured in the same way. 
This is sometimes called Rhad to differentiate it from R. 

In Fig. 12.31 1 show the calculation of Rhad m the quark parton model, including the Z° contribution. 
It is clear that 7-Z interference is important, even far from the Z peak. However, exactly on the peak the 
interference is zero (you might like to think about a simple explanation for why) and Rhad is given to a 
good approximation by the Z contribution alone, 

J2a v a + a l 

Rhad = Nc \ 3 9 = 20.095, (2.12) 

where Vi and ctj are the vector and axial couplings of the Z° to fermion type i. I note for future reference 
that the value including the photon contribution is 19.984. This number compares well with the LEP 
average measured value of 20.775 ± 0.027. However, the difference is still large on the scale of the 
experimental uncertainty, again indicating a clear need for the QCD corrections. 



2.1.1 t decays 

We conclude this section by mentioning the closely-related process of r decay to hadrons, depicted in 
Fig.l2l4l One can apply exactly the same arguments to the blob in this diagram as to annihilation of 
e + e~ to hadrons. The only differences are that we have a virtual W boson producing hadrons instead 
of a virtual photon, and that we have an integral over all virtualities of the W between the r mass and 
zero, rather than a single virtuality fixed by the beam energies. Nevertheless exactly the same arguments 
follow through and one obtains 

^ ^ g(r -, hadrons) =jV 
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Fig. 2.3: Calculation of Rhad as a function of centre of mass energy. 




Fig. 2.4: Decay of r lepton to hadrons. 

where the sum is over the flavours of quark and antiquark that can appear in the W decay and V is the 
CKM matrix. Since a r _ can decay to ud or us, to a good approximation this sum is cos 2 9c + sin 2 9c 
and the final result follows. 

We will see later that this process provides an excellent measurement of a s . 
2.2 Deep Inelastic Scattering 

Historically, the quark model developed as a way of rationalizing the vast array of strongly-interacting 
particles that had been found by the sixties. However, it was not clear whether quarks were really 
physical constituents of hadrons, or merely a convenient mathematical language to describe the hadrons' 
wave functions. The decisive evidence came from deep inelastic scattering experiments at SLAC. Today, 
deep inelastic scattering experiments give us by far the best information about the internal structure of 
the proton. 



2.2.1 Quarks as partons in hadronic scattering 

The classic probe of nuclear structure is electron-nucleus scattering. Assuming the scattering takes place 
by exchanging a single photon, measuring the kinematics of the scattered electron uniquely constrains 
that of the photon. The scattered electron has two non-trivial kinematic variables, its energy and scat- 
tering angle. These can more conveniently be converted into the photon virtuality (Q 2 = —q-q) and 
energy in the nucleus rest frame v. Q 2 controls the resolving power of the photon, Q 2 ~ 1/A 2 . For 
fixed small Q 2 -C l/R 2 , where R is the nuclear radius, the photon is absorbed elastically by the nucleus, 
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Fig. 2.5: Deep inelastic scattering. 

giving a narrow peak in the v distribution at v = Q 2 /2Mn. For increased Q 2 ~ 1/R 2 one begins to 
resolve nuclear resonances as additional peaks at higher v. Finally, for large Q 2 S> 1/R 2 , one resolves 
the proton constituents of the nucleus, with the photon being absorbed elastically by individual protons. 
These show up as a peak at v = Q 2 /2M p , broadened by the internal motion of the proton within the 
nucleus. 

The scattering of electrons off hadrons, protons for example, is exactly analogous: at low Q 2 one 
sees only elastic proton scattering, but as Q 2 is increased, the photon can be elastically absorbed by the 
(charged) quark constituents of the proton. (Eventually at very large Q 2 and v something new happens 
relative to the nuclear case, but we will not discuss that until the next lecture). 

We are interested in the region of Deep (Q 2 S> M 2 ) Inelastic (W 2 S> M 2 , where W is the 
invariant mass of the photon-proton system) Scattering, DIS. We are therefore justified in neglecting the 
proton mass throughout, provided we do not work in the proton rest-frame, which is not well defined in 
that case. This is most conveniently done by working in terms of Lorentz-invariant variables. 



2.2.2 Lorentz Invariant Variables 

It is convenient to describe this in terms of Lorentz-invariant variables. We label the momenta as shown 
in Fig. 12.51 For an electron of momentum k to scatter to one of momentum k! by exchanging a photon of 
momentum q with a proton of momentum p we again have, for fixed centre-of-mass energy s, only two 
independent kinematic variables, 



s = (k + p) 2 , (2.14) 
I 2 , 

Q 2 



Q 2 = -q 2 , (2.15) 



Zp-q 

in terms of which we can calculate two other commonly-used variables 

1 — x 



The kinematic limits are 



(2.16) 



W 2 = {p + qf = Q 2 ^, 
p-k xs 



(2.18) 



Q 2 < s, (2.19) 
Q 2 

x>—. (2.20) 

s 

The coverage of the (x,Q 2 ) plane by the HERA, and earlier fixed target, DIS experiments is shown in 
Fig. I2~6l 
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Fig. 2.6: The x-Q 2 plane, showing the coverage of measurements by various experiments. 
2.2.3 Structure functions 

Since we do not yet know anything about the internal structure of protons, we cannot calculate the matrix 
element for the interaction of a photon with the proton to produce some arbitrary state X. However, like 
in the case of e + e~ to hadrons we can get a surprisingly long way just by considering the properties that 
that matrix element must satisfy. 

We parameterize the matrix element for a proton of momentum p to absorb a photon of momentum 
q and Lorentz index \i to produce an arbitrary set of hadrons X with fixed momenta {px} as 

eT^{p,q\{px})- (2.21) 
We therefore have the matrix element squared for the whole process 

\\M\ 2 = I^IV{^Y} T„(p,q;{p x })T:(p, q ;{p x }). (2.22) 
For convenience we define the Lorentz tensor 

L^ = Tr{#7^V}- (2-23) 

If the state X consists of n hadrons, then the n+l-body phase space for the whole process can be 
factorized into a part describing the electron kinematics times the n-body phase space for X, 

dPS = „ dQ 2 dx dPS X - (2.24) 
16tt z sx z 
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This is as far as we can go for a specific state X, but we can get further by integrating over the phase 
space of X and summing over all possible states X. We define 

^ / dPS x \\M\ 2 ^^- A ZTH^ (2.25) 

or 

]T / dPS x T M (p, q; { Px }) r;(p, «; {px}) = fl^. (2.26) 

Since we have summed and integrated out all dependence on X, H fll/ can only depend on the vectors p 
and q. Since the electromagnetic and strong interactions conserve parity, it must be symmetric in fj, and 
v. There are only four possible symmetric two-index tensors that can be constructed from two vectors, 
so we can parameterize the hadronic tensor as a linear combination of them: 

TT _ IT „ , TT "PllVv , TT Q^Qv , IT V \Av j~ QflPv n 97 N 

tl^v — —ttl9fiv + H 2~Q2~ + H ^~Q2~ + 5 ' U- 2/ ) 

where the £/s are scalar functions of the only two Lorentz scalars available q-q = — Q 2 and p-q = Q 2 /2x, 
i.e. of x and Q 2 only (not s). (Note that we neglect p-p = M 2 since we work in the limit \q-q\ , p-q 3> p-p). 

If we include Z° exchange (or charged current scattering) we can construct one further tensor, 
which is antisymmetric in \i and v, H3 e /11/ \ (7 p x q cr , where e^vXa is the totally antisymmetric Lorentz 
tensor. 

Contracting with L^ v we find that Ha, and H^ cannot contribute to physical cross sections (think 
about a simple explanation why not) and we have 



t p-k p-k' 

~Q 2 



L^H^ = 4k-k' Hi + A T' Z' H 2 ■ (2-28) 



Redefining (just a matter of convention) Hi = iirFi and H2 = 8nxF2, we obtain the final result for the 
scattering cross section 

= % V* F ^ + p - ■ < 2 - 29 > 

Without knowing anything about the interactions of hadrons, we have been able to derive the s depen- 
dence of the scattering cross section for fixed x and Q 2 (which enters through the y dependence: recall 

y = Q 2 /xs). 

The Fs are called the structure functions of the proton. It is common to see other linear combina- 
tions of the structure functions, 

F T (x,Q 2 ) = 2xFi(x,Q 2 ), (2.30) 
F L (x,Q 2 ) = F 2 (x,Q 2 )-2xFi(x,Q 2 ), (2.31) 

which correspond to scattering of transverse and longitudinally polarized photons respectively. We there- 
fore have 

i^w = 5f [ (1 + (1 " y ^ FT ( x > Q ^ + w - vW x > q2 ^\ • (2 - 32) 

In fact the most common form you will see this in nowadays is 

d 2 a 2na 2 



dx dQ 2 xQ 4 



(l + (l-y) 2 )F 2 (x,Q 2 )-y 2 F L (x,Q 2 ) . (2.33) 



For the majority of current data, y 2 is small and Fl can be neglected: only close to the kinematic limit, 
or for very precise data, need it be considered. 
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Fig. 2.7: In the Breit frame, the proton of diameter 2R is contracted to a pancake of thickness 4RxM p /Q (a) so that a photon 
of high virtuality Q interacts incoherently with a single parton within it (b). 

We have isolated all the non-trivial x and Q 2 dependence into the two functions ^(x, Q 2 ) and 
F L {x,Q 2 ), but we still have no idea how those functions behave. If we make the assumption that the 
interaction of the photon with the innards of the proton does not involve any dimensionful scale, then we 
immediately get the result that the dimensionless Fs cannot depend on the dimensionful Q 2 and we get 



d 2 a 



2ttcx z 



dx dQ 2 xQ 4 



(l + (l-y) 2 )F 2 (x)-y 2 F L (x) 



(2.34) 



known as Bjorken scaling. Experimentally this is true to a pretty good approximation, but given that 
the proton is supposed to consist of quarks, bound together with a distance scale ~ l/M p , how can the 
interaction possibly be Af p -independent? The answer to this lies in the parton model. 



2.2.4 Parton distribution functions and Bjorken scaling 

Although it is of course Lorentz-invariant, the parton model is most easily formulated in a frame in which 
the the proton is fast moving. Most convenient is the so-called Breit frame, in which the photon has zero 
energy and collides head-on with the proton. In this frame, the proton energy is Q/2x. Assuming that in 
its own restframe it is a sphere of radius R, in the Breit frame it is massively Lorentz contracted to a flat 
pancake, still with transverse diameter 2R, but with length ARxM p /Q <C 2R, as illustrated in Fig. l2.7h . 
The transverse size of the photon is ~ 1/Q <C 2R. The photon therefore interacts with a tiny fraction of 
a thin disk, so provided that the quarks are sufficiently dilute the photon is not able to resolve the quarks' 
interactions and they act as if they were free. That is, the photon effectively collides with a single free 
quark, as illustrated in Fig. l2.7b . 

Since they act as if they do not interact, their interactions do not introduce a dimensionful scale, 
and so the structure functions will obey Bjorken scaling. 

More precisely, we suppose that the proton consists of a bundle of comoving partons, which cany 
a range of the proton's momentum. We posit probability distribution functions (more often called parton 
distribution functions, pdfs), such that partons of type q carry a fraction of the proton's momentum 
between 7] and -q + d-q a fraction f q (rj) drj of the time. Provided that these partons are pointlike r 2 <C 1/Q 2 
and dilute f q {i]) <C Q 2 R 2 , the photons will scatter incoherently off individual partons. The cross section 
can then be factorized as the convolution of the pdfs with the cross section for parton scattering, 

We will calculate the partonic cross section shortly, but first let me point out a couple of features it must 
have. 

Firstly if we assume that the scattering is elastic, then the outgoing parton must be on mass-shell. 
Since we are then considering a two-to-two collision, which has only one nontrivial kinematic variable, 
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the double-differential cross section in x and Q 2 must be proportional to a 5 function fixing one of the 
variables. Specifically, if we assume that the partons are massless, then we obtain the relation 



(q + VP) 2 = 2 VP -q-Q 2 = 0, (2.36) 

or 

rj = x. (2.37) 

Secondly if we assume that the struck partons are the quarks of the quark model, they must be 
fermions. Simply from helicity conservation, we can then show that Fl = 0. This is known as the 
Callan-Gross relation and was one of the first proofs that the quarks of the quark model really were the 
partons of the partem model. (If the partons were instead scalars we would have Ft = and hence 
completely different y-dependence of the cross section). 

2.2.5 Scattering cross sections 

To calculate the partem model prediction for the structure functions, we need the matrix elements for 
eq — > eq. These can be obtained by crossing symmetry from those for e + e~ — ► qq. That is, 

El A ^ 1 2 o(a \2 2 W (Pe-Pq) 2 + (Pe-P q ) 2 
\M\ = 8(4™) e q N c (p e .j/)2 (2 " 38) 

Converting to the kinematic variables we defined earlier, we have 

Y,\M\ 2 = 8(4vra) 2 e 2 q N c 1 + ^ V? . (2.39) 



Using (12.24b . we have 



O 2 

dPS = ^ dQ 2 dx dPS x ■ (2.40) 
1Qtt z sx z 



Since X consists only of one massless parton, we have 

dPS x = ^-5( P 2 x )(27r) 4 5 4 ( VP + q- Px ) (2.41) 



(2ir)5(( VP + qf) (2.42) 
^S( V -x). (2.43) 



The full cross section is therefore 



w = khww 5{x ~^ M]2 i2M) 

S(x-r,)J2\M\ 2 , (2.45) 



4iV c 16vrQ 4 

where the factor of 1/N C is the average over incoming colours. We therefore have 
and hence 

da(e+p) 2na 2 



dx dQ 2 xQ 4 



(l + (l-y) 2 ^e 2 q xf q (x). (2.47) 
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Fig. 2.8: The structure function F2 as a function of x for various Q 2 values, exhibiting Bjorken scaling, taken from ESW| 1 1. 



Comparing (12.471 with (12.331) we therefore have 

F 2 (x,Q 2 ) = ^e^/,(x), (2.48) 
q 

F L {x,Q 2 ) = 0. (2.49) 

Note that F 2 is Q 2 -independent, showing Bjorken scaling. 

Although we will see that QCD corrections do violate Bjorken scaling, it is satisfied pretty well 
by the data, as can be seen in Fig. 12.81 

2.2.6 Charged Current Neutrino DIS 

We can consider charged current neutrino scattering in exactly the same way. Since the scattering takes 
place by the weak interaction, parity is violated, allowing one additional Lorentz structure, 



T 5 



re _i_ o», hPk' a 



H l9 ^ + H 2 



uupa 



Q 2 Q 2 



Y2rr , ra 1 - y tj , Q 



^L^H^ = 2Q z H 1 + Q 2 ^-H 2 ±^H 3 {l-y/2). 

Thus, defining H3 = 8ttxF^, we have a third structure function F3: 

2 r 

(l + (1 - y) 2 ) F£ - y 2 F[ ± (l - (1 - y) 2 ) xF£ 



d 2 a(l+p) _ G\ 



M 2 



dxdQ 2 4vrx \Q 2 + M% / 
where Gf is the Fermi constant and M w the W boson mass. In the partem model we have 

F{(x,Q 2 ) = ^2x/ 9 (x)+^2x/ 9 -(x), 

1 Q 

xFi(x,Q 2 ) = -X>/ ? (z), 



(2.50) 
(2.51) 

(2.52) 

, (2.53) 

(2.54) 
(2.55) 



where the sums for neutrino scattering are over all partons that can absorb a W + , i.e. d, s, u and c and 
for antineutrino over those that can absorb a W~, i.e. u, c, d and s. 
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Fig. 2.9: Partem distribution function set A from the Martin-Roberts-Stirling group, taken from ESW| 1 1. 
2.2.7 Global fits 

It is also possible to measure DIS on the neutron, or at least on deuterium from which the neutron 



structure functions can be derived. Using strong isospin symmetry, we have the relations 

fu/n(x) = f d /p(x), (2.56) 

fu/n(x) = fd/p(%), (2-57) 

fd/n(x) = f u /p(x), (2.58) 

f s /n(x) = f s / p (x), (2.59) 

and so on. It is conventional to always refer to the proton case, dropping the "/p" subscript. We therefore 

have the slightly confusing result for F| n shown below, in which fd is multiplied by (2/3) 2 , and so on. 

We therefore have 

-^2^ = gXfd + gXfu + gXfj + gXfu + gXf S + gXfg + gXf C + gXfc, (2.60) 

Fy™ = g x fd + \xf u + + ^xfu + \xf s + ^xfg + |x/ c + gXfc, (2-61) 

F V 2 V = 2xf d + 2xfu + 2xf s + 2xf 5 , (2.62) 

xFl v = 2xf d -2xfu + 2xf s -2xf 5 , (2.63) 

Ff = 2xf u + 2xf d - + 2xf c + 2xf- s , (2.64) 

xFl v = 2xf u -2xf d - + 2xf c -2xf- s . (2.65) 



If we make the assumption that f s = f s and / g = f c , then we have six unknowns for six pieces of data 
so, given precise enough data, we could solve for all the pdfs exactly. In practice of course it is never so 
simple and one must make global fits to as wide a variety of data as possible. 

One gets typical results like those shown in figure l2~9l Note that this uses the common notation of 
defining valence quark distributions, 

fuv = fu~ fa, (2.66) 
/*, = fd-fd- (2-67) 
Non-valence quarks are generically referred to as the sea. 
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2.2.8 Sum rules 

Having results for the pdfs, one can form interesting integrals over them, for example 

/' 

Jo 



dxf Uv (x) = 2, (2.68) 



o 



dxf dv {x) = 1. (2.69) 



Various such integrals can be constructed directly from the structure functions. It is worth checking that 
you can reproduce the physical interpretation of each. 

The Gross-Llewellyn-Smith sum rule 

\ £dx(F» p + F£ p )=3, (2.70) 

which counts the number of valence quarks in the proton. In QCD this provides a useful measurement 
of a s , because the right-hand side is actually equal to 3 (l — ^a. _|_ O(o^)). 

The Adler sum rule 

i f 1 ± ( F *P _ F »P\ = i, (2.71) 

J X ^ 

which counts the difference between the number of up and down valence quarks. This has the property 
that it is exact even in QCD, i.e. all higher order corrections vanish. 

The Gottfried sum rule 

£ (i? e p _ p en^ _ q 33 (2 72) 

where the result is experimental. This is sensitive to the difference between the number of up and down 
sea quarks: it would be 1/3 if they were equal. 

The momentum sum rule Finally, we have the particularly significant result 

'f" v + F* p ) « 0.5, (2.73) 



where the result is again experimental. This tells us that only about half of the proton's momentum is 
earned by quarks and antiquarks. 

2.3 Hadronic collisions 

2.3.1 The Drell-Yan process 

If the parton model is correct, the partem distribution functions should be universal. We should therefore 
be able to use the DIS measurements to make predictions for other hadronic scattering processes. The 
classic example is the so-called Drell-Yan process, of lepton pair production in hadron collisions, 

hi + h 2 -> fi + + yT + X, (2.1 '4) 

where the state X goes unmeasured. In the parton model this arises as the sum over all quark types of 

q + q^fi + + y~. (2.75) 
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The cross section can be written as the convolution of pdfs with a partonic cross section, exactly like in 
DIS: 

da(hi( Pl ) + h 2 {p 2 ) -> fi + fi~) f 1 j t t \ f 1 a t i ^ da (<l(ViPl) +q(il2P2) -> 

(2.76) 

where M is the mass of the pair. Note that since the partonic cross section contains a S(M 2 — 

771772s) term, binning the data in M gives extra information about the pdfs. In fact, binning also in the 
rapidity of the lepton pair, denned by 

y=hn f + »- +P *>^- , (2.77) 

both ?] values are fixed, providing a direct measurement of the partem distribution functions (the partonic 
cross section can easily be obtained by crossing the e + e~ — > qq one we calculated in Section [l~6l divided 
by a factor of iV c 2 for the average over incoming colours): 

^ = ^E^uw/.^^w.). (2.78) 

Note that the case h\ = h 2 = p provides a particularly good measure of the sea quark distribution 
functions, which are hard to extract from DIS data. 



2.3.2 Prompt photon and jet production 

Although we have not yet mentioned gluons, we will see in the next lecture that there is also a non-zero 
pdf for the gluon, f g (j]), as can also be inferred from the momentum sum rule mentioned earlier. As well 
as being important for higher order corrections to the processes given above, there are many processes in 
which they participate at tree level. The most important of these are prompt photon production, 



and jet production 



h + h 2 -> 7 + X, (2.79) 



hx+h 2 -> q + q + X, (2.80) 

h 1 +h 2 -> q + q + X, (2.81) 

hi + h 2 -> q + g + X, (2.82) 

h 1 +h 2 -> g + g + X, etc. (2.83) 



The gluon pdf is used in exactly the same way as the quark ones, and hadronic cross sections 
can still be calculated as the sum of convolutions of pdfs with partonic cross sections. Prompt photon 
production receives contributions from two partonic processes, 

q + q 7 + 5, (2-84) 

q + 9 1 + q- (2-85) 

In the case h\ = h 2 = p, the latter dominates, providing a measure of the gluon pdf. However there is a 
slight complication, in that processes (12. 84b . (I2.85t are proportional to a s , which is less well-known than 
a, which controls the other processes we have studied. In fact this is always the case, that measurements 
of the gluon pdf actually measure a s x f g in general. The QCD corrections to this process turn out to be 
a lot larger than any of the others we have considered, further complicating this measurement. 
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2.4 Summary 

We have considered the tree-level phenomenology of e + e~ annihilation, deep inelastic scattering and, 
more briefly, hadron collisions. It is remarkable how much QCD phenomenology can be understood 
using tree level results. However, we have to worry that a s is not so small, so higher order corrections 
must be important. Equally importantly, it would be nice to see whether, and if so how, the parton model 
emerges from QCD. 

We discuss both these issues in the next lecture. 



3 Higher Order Corrections 

3.1 e+e — annihilation at one loop 

In this section, I go through the calculation of the NLO correction to the e + e~ — > hadrons cross section 
in some detail. I will briefly describe some of the more technical aspects of the calculation, for those 
interested, in Section l3.1.2l but those who are not can safely skip this section, since I recap the important 
results at the start of Section l3.1.3l 

In discussing the e + e~ — ► hadrons cross section at tree level, we assumed that quarks produce 
hadrons with probability 1. Therefore we calculated the e + e~ — > qq cross section in Section [l~6l In 
discussing jet cross sections, we extended this to say that all partons produce hadrons with probability 1. 
Therefore we should calculate the total cross section to produce any number or type of partons. At 
leading order this makes no difference, since the only possible process is e + e~ — > qq, but at order a s we 
have to calculate and sum the cross sections for qq and qqg final states. We start with the latter. 

Recall that the total qqg cross section is divergent, 

Ot f x^ ~\~ x^ 

a = a Cf- 1 / dxi dx 2 - 1 Wl 2 r, (3.1) 

2ir J (1 — xi){l - x 2 ) 

where the region of integration is the upper right triangle of the unit square, bordered by the lines x± = 1 
and X2 = 1, which are the singular regions. This divergence must be regularized in some way, before we 
can make progress. 

First though we discuss the origin of the divergences. They arise from propagator factors that 
diverge, 

e, 

11 1 1 

(3.2) 




(p + k) 2 2p-k 2Eu(l-cos9) Ea>9 2 ' 

where E and to are the quark and gluon energies and 9 is the angle between them. 

In the collinear limit, 9 — > 0, one in principle obtains 1 / 9 4 in the matrix element squared, but in 
fact the numerators always contribute a factor of 9 2 , so one obtains 

\M\ 2 ~ 1 (3.3) 

In the soft limit, uj — ► 0, one has in the interference between diagrams in which the gluon is attached to 
quark 1 and quark 2, 

\ M f ~ ~ -1 (3.4) 

p\-kp2-k uj z 

In terms of lo and 9 the phase space is given by 

d?k 

= h iudui sin 9 d9 deb ~ coduj 9d9. (3.5) 

2uj A 

We therefore have logarithmic singularities in both the soft and collinear limits. We generically refer to 
both of these as the infrared limit. 
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3. 1. 1 Regularization 

As in the discussion of renormalization, the simplest way we could regularize this cross section is with 
a cutoff, for example on the transverse momentum of the gluon, which would prevent the integration 
entering both the soft and collinear regions. However, we will see that infrared singularities cancel 
between different contributions, in this case qq and qqg, so we must use a regularization that can be 
consistently applied in all contributions. It is not clear that this is the case for a cutoff, since it must be 
applied in both real and virtual contributions, which have very different structures. Instead, to ensure 
consistent application across all processes, it is better to modify the theory in such a way that some 
dimensionless parameter e regulates the divergences. Then the complete calculation can be performed in 
this modified theory and at the end of the calculation, when all the divergences have cancelled, the limit 
e — > can be smoothly taken. Remarkably, dimensional regularization, which we used for ultraviolet 
singularities, also provides a consistent regulator for infrared singularities, as we shall discuss in detail 
shortly. 

Another regularization scheme, which actually works well in QED, and for simple processes in 
QCD, is the gluon (or photon) mass regularization. We introduce a non-zero gluon mass m 2 g = eQ 2 . 
This prevents the propagators from reaching zero and diverging: for massless quarks the minimum value 
is m g and for a quark of mass m q it is 2m q m g . With this modification one can recalculate the differential 
cross section and integrate it to give a finite result, 

CTqqg = ^ ^ (bg 2 1 - 3 log ^ + 7 - y + 0(e)J . (3.6) 

However, since a non-zero gluon mass violates gauge invariance, this method is bound to fail in general. 
In particular, it is not suitable for any process in which any lowest order contributions have external 
gluons. As in the ultraviolet case, the only scheme that is known to be consistent with all the symmetries 
of QCD, and hence to work to arbitrary orders in arbitrary processes is dimensional regularization. 

The reason why I said that it is remarkable that dimensional regularization works in the infrared 
limit is the fact that the two limits have non-overlapping regions of applicability in the complex d plane. 
Ultraviolet-singular integrals are regularized by working in d < 4 dimensions, but infrared-singular inte- 
grals are only rendered finite by working in d > 4 dimensions. However, by carefully splitting contribu- 
tions that are singular in both the infrared and ultraviolet one can consider the regularization schemes that 
are used in each as independent. In each region, one considers the appropriate dimensionality (d = 4— 2e 
with e > in the ultraviolet and with e < in the infrared) and then analytically continues to the whole 
complex e plane. Since analytical continuation is unique, this gives a unique result for each, in the region 
of applicability of the other, and the two can be combined before the limit e — ► is taken. This subtlety 
leads to some surprising results, for example for the self-energy of a massless quark, discussed below. 

As the calculation of cross sections in dimensional regularization is rather technical, it is rare to 
see it done in summer school lectures, but I think it brings out some interesting points, so I at least 
sketch how the calculation works in Section 13.1.21 As I said, those who disagree can safely skip ahead 
to Section l3~l~3l 

3.1.2 Aside: Real and virtual corrections in dimensional regularization 

It is straightforward to generalize the Feynman rules to d dimensions and fairly straightforward to gener- 
alize the Dirac algebra. The result is that (i-dimensional matrix elements still have propagators ~ 1/p 2 , 
but that the numerators become d dependent. (It is worth mentioning the closely-related dimensional 
reduction scheme, which is often used for supersymmetry calculations, since conventional dimensional 
regularization violates supersymmetry. In this scheme one works in d dimensions, but modifies the the- 
ory in such a way that fermions and massless vector bosons still have 2 spin states, instead of d — 2 
as in dimensional regularization. The result is that the matrix elements themselves are equal to the 4- 



29 



dimensional ones and it is only on performing the loop and phase space integrals that the d dimensionality 
gets introduced). 



Phase space integrals We will have to integrate over d-dimensional phase space. We begin by consid- 
ering integer values of d and then continue the results to real values. It is straightforward to write down 
the basic integration measure, 

d d k 5+{k 2 ) = ^— — = \uj d ~ z duj dfi d _ 2 , (3.7) 
2uj 2 

where u is the energy of k and d^^-2 is an element of d— 2-dimensional solid angle. The only difficulty 
concerns the evaluation of integrals over this solid angle. In four dimensions we have 

k = to (1; sin (j) sin 9, cos 4> sin 9, cos 9) (4 dimensions) , (3.8) 

where 9 and <fi are the usual spherical polar coordinates with 9 the polar angle and <f> the azimuthal angle. 
In five dimensions we have 

k = u(l;sinip sin<f> sin#,cosT/> sin</> sin(9,coS(/> sin#,cos#) (5 dimensions), (3.9) 

where tp is an azimuthal angle in the additional dimension. Generalizing to d dimensions, we have d — 4 
additional azimuths and we write k generically as 

k = u)(l; . . . ,cos(f) sin9,cos9) (d dimensions) , (3.10) 

where the ellipsis represents a d— 3- vector of length sin0 sin 9 containing d — 4 azimuths. Depending 
on the complexity of the calculation, more or less of these additional additional components have to be 
specified precisely. In fact in our case, since we only consider the relative orientations of three momenta 
that have zero total momentum, and therefore all lie in a plane, it is sufficient to specify 

k = . . . ,cos9) (d dimensions), (3.11) 

where the ellipsis represents a d— 2- vector of length sin 9 containing d — 3 azimuths. 

We can see how to integrate over the additional azimuths by again considering integer d and then 
generalizing, 



dfii = J 64 = 2vr, (3.12) 
dn 2 = J d<f) sin 9 d9 = 4ir, (3.13) 
J dn 3 = J dip sm(t>d(t> sin 2 9d9 = 2vr 2 , (3.14) 
and so on. We have a recursion relation 

Jdn n = j dn n -! sin n ~ 1 9d9, (3.15) 

which is solved by 

O n = dtt n = -r, r— ; — r . (3.16) 

J T[(n + l)/2] 

We are now equipped to tackle the phase space integral, and see how the dimensional regularization 
succeeds in regularizing our integrals. 
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Fig. 3.1: One-loop diagrams for e e 



qq. 



Regularization Since the form of the propagator factors is unchanged in d dimensions, and it is these 
that dominate the singular region, it is straightforward to read off the behaviour in the regularized theory. 
In the soft region we have 



1 



or 



dtu 



o v 



1 + 2<E 



1 

2? 



and in the collinear region 



8111 



l-2e , 



de 



9°- 



d9 

3l-2e 



1 

2? 



e < 0, 



e < 0. 



(3.17) 



(3.18) 



Since our cross section is divergent in both limits, and they can overlap, i.e. a radiated gluon can be both 
soft and collinear, we expect the total cross section to be of order 1/e 2 . Note, as a consistency check, 
that the integrands are positive definite and that, in the region in which they are well-defined, e < 0, the 
results are positive (and divergent as e — > 0). 



Total e+e - — > qqg cross section We now have all the ingredients we need to calculate the differen- 
tial cross section for e + e~ — > qqg and to integrate it over all phase space in dimensional regularization. 
We obtain 

o m = a C F ^H(e) (| + I + H + (ef) , (3.19) 

where erg is the lowest order cross section and H(e) is a smooth function, with H(0) = 1, that we will 
not ultimately need to know. Note that, as we anticipated from Eqs. (13. 1713.1 81) . this result is positive, 
and divergent like 1/e 2 as e — > 0. 

So far, the regularization scheme has succeeded in quantifying the degree of divergence of the total 
three-parton cross section, but it has not helped us solve the problem of the divergence, by recovering a 
finite result for a physical cross section. As we already anticipated above, this will come by calculating 
the loop correction to e + e~ — > qq. 



<x(e + e — — > qq) at one loop We already made the point that to calculate the total cross section for 
e + e~ — ► hadrons, we must sum over all e + e~ — > partons processes. At this order of perturbation theory 
qq is the only other process that contributes. There are three diagrams, shown in Fig. 13.11 They are down 
by one power of a s relative to the tree-level diagram, 

Mi oc a s M . (3.20) 

Therefore |A^i| 2 is two powers down and hence negligible at the order to which we are working. How- 
ever, since the final state is the same as that of the tree-level diagram, the two interfere, and their inter- 
ference, IRejA^gA^i} does contribute at order a s . 
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In quantum mechanics, you know that we must sum over all unobserved quantum numbers at the 
amplitude level. Since the gluon momentum is unconstrained by the outgoing quark momenta, we must 
sum over all gluon momenta, 

j d d k. (3.21) 

Note that there is no mass-shell-constraining delta-function: the virtual integral is over all arbitrary on- 
and off-shell momenta. 

We begin with the first two diagrams, which are proportional to the self-energy of a massless quark. 
It is actually easy to see that these have to be zero in dimensional regularization: the value of the integral 
has dimensions E d ~ A , but by Lorentz invariance the result of the integral can only be a function of the 
square of the quark's momentum, p 2 = 0, so there is nothing that can provide this dimensionality . 
The only way these two facts can be reconciled is if the integral is zero. However, if we examine the 
integrand somewhat closer, this is very surprising, because it is positive definite. How can a positive 
definite quantity integrate to zero? 

The answer to this question comes from a subtle use of dimensional regularization. In fact this 
integral is divergent in both the infrared and ultraviolet. If we split the integral into two parts by introduc- 
ing an arbitrary separation scale A, then we obtain an ultraviolet contribution ~ A~ 2e /e and an infrared 
contribution ~ —A~ 2e /e. Each is positive in its domain of applicability (e > and e < respectively), 
but after analytically continuing each to arbitrary e, they are exactly equal and opposite, giving a zero 
result for these diagrams. 

Turning to the third diagram, the vertex correction, we find that it is also divergent in the infrared 
and ultraviolet regions. However its ultraviolet divergence is exactly equal and opposite to the one from 
the sum of the two self-energy diagrams. Therefore the sum of the three diagrams is ultraviolet finite and 
no renormalization is needed at this order. This actually follows directly from the Ward identity of QED. 
Thus, one simply has to evaluate the vertex correction diagram in dimensional regularization, to obtain 
the complete order a s contribution to e + e~ — > qq. We find that the infrared divergences do not cancel, 
and we obtain 

= °* C ^ H ( e ) ("I " 7 " 8 + °( £ )) • 0-22) 

Dimensional regularization has succeeded in regularizing the divergence of this contribution as well. 
This time however, the result is negative and divergent as e — > 0. This should not surprise us, as we 
already noted that this is an interference term, so there is no requirement that it be positive, as there was 
for Tqqg. In fact a quick glance at Eqs. (13. 1913.22b shows us that the divergences are going to cancel 
between them. 



3.1.3 The total cross section 

In the previous section we discussed how dimensional regularization provides finite results for the total 
cross sections for the e + e~ — > qq and e + e~ — > qqg processes, which each diverge as e — ► 0. For the 
benefit of those who slept through it, I recap them here: 

|-^-8 + 0(e)V (3.23) 
| + 7 + f + (3-24) 

According to our earlier discussion, the total cross section for e + e~ — ► hadrons is given by the sum of 
the two. It is finite, so the limit e — ► can be taken, 

3 'in fact this statement relies on working in a covariant gauge. In a lightcone gauge for example, the self-energy can depend 
on n-p. This diagram is not then zero, but of course the final answer for the sum of the three diagrams is gauge invariant. 



<7qq = O-Q C F ^H(e) 



a 



qqg 



a C F ^-H(e)( 



32 



ao (l + C F ^- 
2vr 2 



Ofj H 

7T 



(3.25) 
(3.26) 



Of course this would be useless if it depended on the regularization procedure. The proof of its indepen- 
dence is beyond us here, but it is worth demonstrating it, by comparison with another scheme, the gluon 
mass regularization, in which we have 
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(3.27) 

(3.28) 
(3.29) 



Note that the individual cross sections have completely different forms in the different schemes, but that 
the sum of the two is scheme independent. 

Equation 13 .261 is one of the most fundamental quantities in QCD and is certainly one of the most 
well-calculated and measured. Despite the fact that it is a relative small correction to the total rate, 
experimental and theoretical systematic errors are so small that they can almost be neglected - even with 
the large statistics of r decays and Z decays at LEP, the statistical errors dominate. This means that 
not only does it provide one of the most accurate measurements, but its quoted accuracy is rather easy 
to interpret and implement in global analyses for example, unlike measurements that are dominated by 
systematics. 

Equation l3.26l is now known up to order af . As discussed in sectior fT~8l renormalization introduces 
a renormalization scale dependence into a s and the coefficient functions beyond the first one, 



♦hadrons 




(3.30) 



Reducing this renormalization-scale dependence is one of the biggest reasons for going to higher orders. 
As can be seen in Fig. 13.21 the scale-dependence is indeed significantly smaller at each order, giving 
stability over a wider range of \x. It can also be seen that provided \i is of order Q, the higher order 
corrections are relatively small. We will see shortly that simply taking the leading order result with 
fi = i/s does surprisingly well and is certainly sufficient to understand the phenomenology. 

3.1.4 a s measurements 

As I mentioned above, the experimental measurement of R e + e - gives one of the best measurements 
of a s . In fact the LEP combined value of Rhad is 



while the tree-level prediction is 



R(LEP) = 20.767 ±0.025, 



Rq(M z ) = 19.984. 



(3.31) 



(3.32) 



Combining the two, and simply using the leading order result with [i = M z , we obtain our first measure- 
ment of a s , 

a s (M z ) = 0.124 ± 0.004, (3.33) 



surprisingly close to the 'official' value using the three-loop result, 0.123 ± 0.004. 
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Fig. 3.2: The QCD prediction for the corrections to R e + e - at y/s = 33 GeV as a function of renormalization scale at leading, 
next-to-leading and next-to-next-to-leading order, taken from ESW| 1 1. 



As we discussed in Section [T.8.3l since QCD predicts the scale dependence of a s , one measure- 
ment at any scale is sufficient to give a prediction for all scales. We can therefore phrase measurements 
at other scales either as tests of QCD throughout the intervening energy range or, by translating them all 
into measurements at a single scale, as different measurements of the same quantity that can be combined 
to give a better overall measurement. 

As an example, the average measurement of R over several energy points around 34 GeV is 

R(PETRA) = 3.88 ± 0.03, (3.34) 

while the tree-level prediction is 

#0(34 GeV) = 3.69. (3.35) 
Again using the leading order result, we obtain 

a s (34 GeV) = 0.162 ± 0.026. (3.36) 

Finally, using the one-loop renormalization group equation, we can convert this into a measurement of 

a s (M z ), 

a s (M z ) = 0.134 ± 0.018. (3.37) 

This is in good agreement with the value from LEP, although with much larger uncertainties, simply due 
to the fact that the statistics of the PETRA experiments were much lower. 

As a final example, we consider r decays. The QCD corrections to the hadronic decay rate actually 
have two effects: on the ratio of branching fractions, R T , as discussed earlier, and also directly on the 
total decay rate of the t. These can form the basis for two analyses in which the experimental errors are 
largely independent. The combined result for the two is 

a s (M T = 1.77 GeV) = 0.33 ± 0.3. (3.38) 

This time, because we are translating over such a wide energy range the one-loop renormalization group 
equation does not do quite such a good job, 

a (one-loop)^ M ^ = Q 12g ± Q 0Q4 ^ (3 39) 
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Fig. 3.3: Results of a recent compilation of a s values |3|. 
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but it is not so far out. Note in this case the phenomenon of the "incredible shrinking error". Although the 
measurement at the r mass scale only has a precision of 10%, after evolving it to M z the relative uncer- 
tainty gets scaled down by the ratio of the two a s values, and r decays give one of the best measurements 

of a s (M z ). 

The results of a recent compilationO are shown in Fig. 13.31 The scale dependence shows excellent 
agreement with the predictions of perturbative QCD over a wide energy range. When translated into 
measurements of a 3 (M z ), the 33 separate measurements all manage to overlap with the world average 
value, 

a (world average)^ = q_-j_183 ± 0.0027. (3.41) 



3.2 Deep Inelastic Scattering Revisited 

The partem model I described in the last lecture assumed that the partons are non-interacting. But we 
know that they do interact via QCD, so what will happen when we consider these interactions? We 
will discover that the structure functions do become slowly (logarithmically) varying with Q 2 . We start 
by considering the next-to-leading order QCD corrections to quark scattering. We will find that these, 
if calculated naively, would be divergent, but that these divergences can be absorbed into the partem 
distribution functions. These will then become scale-dependent, giving rise to the (^-dependence of the 
structure functions. 



3.2.1 NLO corrections to DIS 

The next-to-leading order corrections come from three sources (recalling that we sum and integrate over 
all final states X, so we must sum over all contributions in which any kind of parton is scattered): 

1. One-loop corrections to eq — > eq, 
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2. eq — ► eqg, 

3. eg -> egg. 

The third contribution is completely new in QCD and is not present in the parton model. We come back 
to it in a later section. The other two can more genuinely be thought of as higher-order corrections to the 
parton model process. We start with the second. 

There are two contributing diagrams. The matrix element squared can be obtained by crossing 
from e + e~ — > qqg (11.66b . Labeling the momenta as 

e(k) + q(r] P ) e(fe') + q( Pl ) + g(p 2 ), (3.42) 

we obtain 



8C F N c eW 



k-k'pi-p2 r/p-p 2 



— ((Pi-k) 2 + (vp-k) 2 + (p r k') 2 + (r,p-k') 2 ) 



As usual the phase space is given by J2.24I) . 

dPS 



Q 2 



16tt 2 sx 2 

This time X consists of two partons so is non trivial, 

d ps x - dcosd 



dQ 2 dx dPS X - 
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(3.43) 



(3.44) 



(3.45) 



where 9 and (p refer to the direction of p\ in the centre-of-mass system of -qp + q. It is conventional to 
replace cos 6 by the manifestly Lorentz-invariant variable z, 



z = 



Pi-P 
q-p 



with range < z < 1, giving 



dPS x 



i(l -cos0), 



dz dcp 



16vr 2 ' 

It will later be instructive to know the transverse momentum of p\ in this frame, 



k i 



Q 2 (^-l)z(l-z). 



(3.46) 



(3.47) 



(3.48) 



Note also that the case r/ = x corresponds to a massless final state. Kinematically this can only happen 
if either p\ or p 2 are infinitely soft (i.e. have zero energy), or if they are exactly collinear. 

We therefore have 



da 2 (e + q) 1 1 Q 2 



dzdcj) 8C F N c e^e 2 g 2 



dxdQ 2 4N C 2s 16tt 2 sx 2 J 16tt 2 k-k' ' pi-piWVi 
Rewriting in terms of our kinematic variables and averaging over 4>, we have 
/ {Pi-k) 2 + {rip-k) 2 + {pi-k') 2 + (r]p-k') 2 \ 



( Pr k) 2 + ( VP -k) 2 + ( Pl -k') 2 + (vp-k 1 ) 2 ) . 

(3.49) 
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(3.50) 



where x v = x/rj. Two things are already clear: at this order we will have a non-zero longitudinal struc- 
ture function, Fl(x,Q 2 ); and the z integration, which runs from to 1, will give a divergent contribution 
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to i*2. This should worry us, since we are calculating a physical cross section, but let us continue for a 
while and see what happens. 

Putting everything together we have 



Cpa 2 e 2 a s 



da 2 (e + g 
dxdQ 2 2r]x 2 y 2 s 

and hence 
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(3.52) 

The divergence at z — > 1 coiTesponds to kinematic configurations in which the outgoing gluon becomes 
exactly collinear with the incoming quark. Therefore in the Feynman diagram in which the gluon is 
attached to the incoming quark, the internal quark line becomes on-shell, causing the divergence. Note 
also that the coefficient of the divergence itself diverges at the point x v = 1, at which the gluon is 
infinitely soft. 

In order to study the divergence, let us first regulate it by calculating the contribution from emission 
with k\ > ii 2 (and assume /i 2 <C Q 2 for simplicity). Since k\ is proportional to (1—z) this will give us 
finite integrals. At any time, the full result can be obtained by setting fi — > 0. We therefore obtain 

F 2 (x, Q 2 ) = W 1 dx p e 2 q —f q (—) ^ (p(x p ) log % + R(x p )) , (3.53) 

q Jx X p \ x p J ^ 7r \ A* / 

where the function R(x p ) is finite. In the following we will not keep track of this function, although it 
would be essential for quantitative analysis. The function P(x p ) we introduced in d3.53t is called the 
splitting function (or more strictly speaking the unregularized splitting function), 

P(x) = C F \±^. (3.54) 
1 — x 

It actually describes the probability distribution of quarks produced in a splitting process, q — ► qg in 
which the produced quark has a fraction x of the original quark's momentum. (We will quantify this 
statement slightly more shortly). 

Obviously by regulating the divergence we have not removed it: physical cross sections are still 
supposed to be obtained by setting /x — > 0, in which case F 2 is logarithmically divergent. However, 
before discussing what happens to this divergence, let us consider the virtual one-loop correction to 
eq — ► eq. Since this diagram contains two quark-gluon couplings, when squared it would give an 0(a 2 ) 
correction. However, since it has the same final state as the lowest order diagram we must consider the 
interference between the two, and this interference is 0(a s ), so we must include it. 

We could obtain the result for the one-loop diagram by crossing from e + e" — > qq. However, to 
illustrate the physics, it is sufficient to recall a few of its features. Firstly, since the external particles 
are the same as in the lowest-order process, the kinematics must be the same. In particular, it can only 
contribute at the point rj = x. Secondly, as in the e + e~ case, the interference of the one-loop and tree- 
level diagrams is divergent and negative. In fact the kinematic regions in which the one-loop integrand 
diverges are exactly the same as those of the eq — > eqg contribution we have just considered: when the 
gluon is soft, or is collinear with either of the quarks. 

It turns out that the divergence is exactly right to cancel the one we obtained above at x v — ► 1. In 
fact one finds that after including the one-loop contribution, one gets exactly the same formula as (I3.53t 
except that the unregularized splitting function P(x p ) is replaced by the regularized one, P(x p ), 



P{x) = P(x)+P virtual (x) 



(3.55) 
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Since the one-loop contribution has the same kinematics as the lowest-order process, Pvirtual^) must be 
proportional to 5(1 — x). P(x) is therefore a distribution. 

To define it, we will need to use a mathematical trick called the plus-distribution. Given some 
function f(x), which is well-defined for all < x < 1, we define a distribution f(x) + on the region 

< x < 1, as 

f(x) + = f(x)-5(l-x) f'dx'fix'). 

Jo 

The plus-distribution is most useful when the function f(x) is divergent at x 
any other function g(x), which is smooth at x = 1, we have the property 



(3.56) 

1. This means that for 



dx f{x) + g(x) 



dxf(x) (g(x)-g(l)). 



Provided that g(x) goes to g(l) sufficiently quickly, this integral is finite. 

After including the virtual contribution, the splitting function is given by 

r 2 

P{x) = C F ' 



1 + ^ , 3 

— + -5(l-x) 

{l-x) + 2 



(3.57) 



(3.58) 



This is actually the first correction to a function that describes the momentum distribution of quarks 
within quarks, 

V{x) =s(l- x ) + ?* log 9l P (x) + 0{a 2 s log 2 ), 
where the distribution is defined to be a pure quark at scale Qq and probed at scale Q. 



(3.59) 



Inserting the full splitting function into (I3.53t . we find that the divergence at x p — > 1 cancels 
between the real and virtual terms, but the divergence for x p < 1 due to the region z — > 1 still remains. 



3.2.2 Factorization of divergences 

To understand why the results are still divergent even after including the virtual terms, and what ulti- 
mately happens to the divergences, we consider their physical origin. Like the e + e~ annihilation case, 
we have singularities from regions in which the real gluon is collinear with either the incoming or out- 
going quark, or is soft, and also from the virtual graph, as illustrated in Fig. 13.41 All these contributions 
were present in e + e~ annihilation, but there we found that the divergences all cancelled to give a finite 
contribution. Why is the present situation different? In fact we find that here the magnitudes of the 
divergences are such as to cancel, but that the divergences arise in different regions of the x p integral, so 
are prevented from canceling. 

In the e + e~ case, we argued that the singular regions of real emission were indistinguishable from 
the lowest order process, since an infinitely soft gluon could not produce any hadrons and the jets pro- 
duced by two collinear partons were indistinguishable from a single jet with their combined momentum. 
This statement is true here for the soft and final state collinear contributions, but not the initial-state 
contribution. The final state of this contribution is indeed indistinguishable from the lowest order pro- 
cess (it has an additional jet collinear with the outgoing proton remnant, but this too gives a jet and 
the supeiposition of the two is indistinguishable from the proton remnant in the lowest order process). 
However, because we have used the parton model, we must convolute the partonic cross sections over an 
arbitrary (measured from experiment) probability distribution function, processes with different incom- 
ing momenta are effectively distinguishable. In all the singular regions, the final state of the process is 
massless, and this fact fixes the incoming momentum (to the value Q/2x in the Breit frame), but in the 
initial-state singular process it is the internal line whose momentum gets fixed, as indicated in Fig. 13.41 
Thus the incoming momentum in (c) is larger than in the other cases and its divergence, at i] > x, cannot 
cancel the others, at rj = x. 
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Fig. 3.4: Divergent contributions to DIS: (a) soft, (b) final state collinear, (c) initial state collinear and (d) virtual. The label p 
shows which momentum in each contribution is fixed by the massless final state condition. 

As I mentioned earlier, these divergences come from the virtuality of the internal particle van- 
ishing and hence the propagator diverging. Using the uncertainty principle, vanishingly small virtuality 
corresponds to arbitrarily long time-scales. This seems to be in direct contradiction with the assumption 
underlying the partem model, that the virtual photon takes an extremely rapid snapshot of the proton. 

The problem is actually one of overcounting. We first introduced the pdfs, which are supposed to 
contain all information about the internal structure of the proton. Presumably this internal structure is 
the result of QCD interactions. We then tried to calculate the QCD corrections to the quark scattering 
cross sections, integrating over all final states, so all energy-scales. But these QCD corrections should 
somehow already be included in the internal dynamics of the proton. 

To resolve this overcounting, we have to separate (or 'factorize') the different types of physics at 
different energy scales. Like in our discussion of renormalization, I will first try to give the physical 
picture in terms of a cutoff, before returning later to describe how factorization works in practice in 
dimensional regularization. We introduce the factorization scale \i, and call all physics at scales below \i 
part of the hadron wave function, and lump it into the parton distribution functions, and call all physics 
at scales above fi part of the partonic scattering cross section (or 'coefficient function'). 

Therefore we do in fact have a transverse momentum cutoff in the eq — > eqg process and the form 
of (I3-53I > is correct. 

Since physics at scales below \i is included in the pdfs and physics above is not, the pdfs them- 
selves must become ^-dependent. We therefore have 

F 2 (x, Q 2 ) = "£4 jf * dx p ^-f q Ll -x P ) + ^ \P{xp) log ^ + R(x p )j + , 

(3.60) 

where the function R(x p ) is not necessarily the same one as earlier, as the virtual contributions could 
have introduced some additional finite terms. 

Note that the structure functions are now Q 2 -dependent, violating Bjorken scaling. However they 
also appear to be [i 2 dependent, which should worry us: fj, was introduced in a completely ad hoc 
theoretical way: it simply separates physical processes into two parts that are dealt with in different 
ways, and the final result, which is the sum of the two parts, should not depend on where the separation 
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was made. We return to discuss this point in more detail after calculating the ^-dependence of the pdfs. 

It is important to emphasize that, although we have derived these formulae for the higher order 
corrections to DIS, the leading logarithmic behaviour is universal. In particular, for any quark-induced 
process with a hard scale Q, we expect a hadronic cross section of the form 

o-hiPh) = J2 J dr > fi (/?' j^m) + |^ lo S~~2" / dzP (. z ) a q(. Z7 lPh)^j , (3-61) 
where a g (p) is the partonic cross section for a quark of flavour q and momentum p. 



3.2.3 DGLAP evolution equation 

Although the pdfs are fundamentally non-perturbative and cannot be predicted from first principles at 
present, physics at scales close to p 2 can be described perturbatively. We can therefore calculate the p 2 - 
dependence of the pdfs so that, given their value at some starting scale pq, for example from experimental 
measurements, we can calculate their values at all higher scales p. 

To do this, we use the fact just noted, that physical cross sections should not depend on p 2 . There- 
fore we should have 

2 dF 2 (x,Q 2 ^ 
dp 2 

or at least, since we are working at (D(a s ), 



,, 2 dF 2 (x,Q 2 ) 2 

dp? = ' s '' (3 .63) 



Applying this to J3.60l >. we obtain 

d 




Equation (I3.64t is called the Dokshitzer-Gribov-Lipatov-Altarelli-Parisi (or DGLAP, or GLAP, or 
Altarelli-Parisi for short) evolution equation. Note that the rate of change of the pdf at some x value 
depends on its value at all higher xs. 

To understand its physical content, it is useful to rewrite the splitting function, 



P(x) = C F 



1 + x 2 3 



(1 



+ o*( 1 



1 + X 2 



Cf\— , (3.65) 



to give 

2 d ( 2 x a s f l dx p ( x 2 \ 1 + x 2 a s , 2 x f 1 1 + x 2 

*di? f *( X > ft ) =CF toL I^ fq {- P ^ )— p - CF 2n- f it X ^)] dXp — p (3 " 66) 

The first term represents the fact that the pdf at a given x value is increased by quarks with higher 
xs reducing their momentum fractions by radiating gluons. The second term represents the fact that 
it is decreased by the quarks at that x reducing their momentum fractions by radiating gluons. Each 
contribution is divergent due to emission with x p — ► 1, i.e. infinitely soft gluon emission, involving an 
infinitely small change in x. However the two divergences exactly cancel because the number of quarks 
being lost to this x value by infinitely soft gluon emission is equal to the number being gained. 

The DGLAP equation is most easily solved in moment space. For any function f(x), we define 

f N = f 1 dxx^ 1 f(x), (3.67) 
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the Mellin transform. Taking moments of both sides of (13.641) . we obtain 
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It is common to introduce the notation 
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(3.68) 
(3.69) 

(3.70) 



where 7^ is known as the anomalous dimension. If we assume that the coupling a s is fixed, we can 
easily solve ( 13.69ft with the boundary condition of given values for f q w at some fixed scale fio, 

2 \ In (as) 



fqNiv ) = /gArOo) 



(3.71) 



However, as we have seen, the renormalization of QCD means that the coupling constant becomes 
scale dependent, a s (fi 2 ), according to renormalization group equation 



(3.72) 



Inserting the solution of the running coupling, Eq. (11.73b . into (I3.69t . we obtain 



fqNi/J 2 ) = fqN(l4) 



(3.73) 



Having the solution for f q in moment A^-space, we have to convert it back to x-space. This is done 
by the Inverse Mellin Transform, where f q N is continued to the complex plane, 



dN f qN x 



-N 



(3.74) 



1 

27ri Jc 

where the contour C runs parallel to the imaginary axis to the right of all poles. Because of the complexity 
of this process, the DGLAP equation is often solved simply by 'brute force' numerical solution of d3.64l i. 



Beyond 0{a s ) the general structure of (13.691) and (13.721) remain unchanged: the anomalous di- 
mension and (3 function simply become power series in a s . 



3.2.4 Scheme/scale dependence 

Factorization, as introduced above, may seem pretty arbitrary. However it can be proved to all orders in 
perturbation theory. The most convenient way to do this is to use, instead of the transverse momentum 
cutoff we used above, dimensional regularization. When we calculate the NLO cross section in d dimen- 
sions, the divergence shows up as a pole, 1/e. The coefficient multiplying this pole turns out to be the 
same splitting function we encountered earlier. 

In d dimensions, we obtain for the structure function up to 0(a s ), 

f 2 (x, q 2 ) = J24 [ dx p~Ji {£) j^ 1 - x p) + £ ( (^jr) + R(Xp) ) + ° {e) } ' 

(3.75) 

where \i is the scale introduced to make the coupling constant dimensionless. Note that I have sneakily 
added a bar to f q and that it is scale independent. f q is known as the bare pdf. We now note that the 
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distribution functions themselves are not physical observables, only their convolution with coefficient 
functions. I can therefore define a modified set of distribution functions as follows: 

xf q (x) = £dx p ^/,^,^|*(l-xp)-g^^j ^P(x p )+K(x p )^Y (3.76) 

where fip is the (completely arbitrary again) factorization scale, and K(x p ) is a completely arbitrary 
finite function to be discussed shortly. (To fit in with the standard notation, I should really multiply all 
occurrences of a s by 1/T(1 — e) = 1 — 7^e + 0(e 2 ), but this will merely change the values of R(x p ) 
and K(xp) which I do not specify anyway). Combining (13.75b and (I3.76I) . we end up with 

F 2 (x, Q 2 ) = £ e 2 f 1 dx p -f q (-,» 2 F ) {6(1 -x p ) + ^ (p(x p ) log % + R(x p ) - K(x p )) + 0(a 2 s ) 

q J x X P \ X P J { /7T \ ) 

(3.77) 

Note that this has the identical form to <3.60t . except for the finite function. It is clear from d3.76t that 
fq(x,fj,p) depends on the function K(x p ). It therefore seems like we have no predictive power: the 
pdf and coefficient function each depend on the completely arbitrary function K(x p ) and the completely 
arbitrary scale [i p (note that all dependence on \i has again completely cancelled. As I said in the context 
of renormalization, many textbooks simply set it equal fj, right from the start, but I consider this slightly 
confusing as they have quite different physical meaning. Having performed this manoeuvre, I henceforth 
drop the subscript p). However the factorization theorem proves, firstly that for any physical quantity, 
all dependence on K(x p ) and \i will cancel and secondly that the scheme- and scale-dependent pdfs, 
f q {x, /x 2 ) are universal (i.e. process-independent). 

Two schemes are in common use, the MS scheme in which K(x p ) is zero, and the DIS scheme in 
which K(x p ) = R(x p ), i.e. in which for [i = Q the partem model result is exact. 

To understand the physical content of the scheme-dependence, it is worthwhile going back to the 
case with a cutoff. If, instead of a cut on transverse momentum we had used a cut on the virtuality of the 
internal quark line to separate the pdf from the coefficient function, we would have got exactly the same 
form as (13.60b except that R(x p ) would have been a different function. In particular, it would differ by a 
log[(l — Xp)/x p ] term, together with some non-logarithmic terms. In fact, all logarithmic terms turn out 
to be the same with a p t cutoff as in the MS scheme, so for many puiposes the two can be considered 
equivalent. 

Although dependence on the scheme and scale must cancel in physical quantities, it is only guar- 
anteed to do so after calculating to infinite orders of perturbation theory. At any finite order there can be 
some residual dependence. We must therefore have a procedure for choosing a value of fi. Essentially 
the identical discussion we had for the renormalization scale choice applies here. One can show that a 
structure like (13.60b continues to all orders of perturbation theory and that for every power of a s , one 
gets a power of log Q 2 / fi 2 . Thus every order of perturbation theory contains terms like a" log" 1 Q 2 / /j, 2 , 
m < n. It is clear that if /i is a long way from Q, the log can be large enough to compensate the smallness 
of a s and the perturbative series will not converge quickly. One should therefore choose \i 'not too far' 
from Q. 

It is worth mentioning that one can set up DGLAP evolution equations for the Q 2 -dependence 
of the structure functions, F 2 and Fl, themselves. These are then automatically scheme- and scale- 
independent even at finite orders of perturbation theory. This is sometimes known as the scheme- 
independent scheme. 

3.2.5 Initial state gluons 

As mentioned right at the start of this section, we also obtain 0(a s ) corrections from the process eg — ► 
eqq. Most of what we said above carries over in a straightforward way. Although there is no soft 
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P qq {x) = C F 



1 + , 3 

0^ + 2 5(1 - x) 



P gff (x) = Tr 



x 2 + (i-x) s 



^l + (l-x) 2 




P g Jx) = c F 



P gg (x) = C A 



2x 1 — x 

7- r- + 2 + 2x1 - X 

(1 — X)+ X 



+PoS(l - x) 

Fig. 3.5: The four DGLAP splitting functions of QCD. 

singularity or virtual term to cancel it, there is a collinear singularity. This corresponds to a two-step 
process in which a gluon splits to a q-q pair, one of which interacts with the photon. The singularity 
again corresponds to the virtuality of the internal quark line going to zero. This singularity can again be 
absorbed into a factorized universal pdf for the gluon. We end up with an additional contribution to the 
structure function of 

F 2 (x, Q 2 ) = E 4 £ dx P^~/s Mr, fA | J log ^ + Rg(x p ) - K qg (x p )\ + 0(a 2 s )^ , 

(3.78) 

where the sum over q is over all 'light' flavours. We now have four different types of splitting function, 
illustrated in Fig. 13.51 The DGLAP equation now becomes a set of coupled equations: 



A* 



d 
dfi- 



r/« 0- A* 2 ) = E ? r —fb (-, fA Pa b (x P ) + 0{al). 



(3.79) 



In moment space, this can be conveniently written as a matrix equation (in general of (2Nf+l) x (2i\fy+l) 
matrices, but for simplicity we show the case of only one flavour of quark): 



fqN \ ( lqqN{at S {v)) J qgN (a s {(i) ) 

I fa* = lqqN{a S (v)) JqgN{a S (v)) 

fgN I \ 7gqN{a S (n)) JgqN(a S {n)) 7ggN {ol s (//)) 



d 



dji 




(3.80) 



Exactly the same solution is obtained, but in matrix notation, 





/ fqN(H 2 ) 
fqN(V 2 ) 

V fgN^ 2 ) 



exp 



M 2 d /2 / lqqN{a S (n')) 

j qqN {a s (fJ,')) j qgN (a s {fi')) 



,/2 



7g 3 Af(a s (At')) 

7qgN(a S (fJ.')) 

\ j g q N {a s {fJ-')) ~/ g g N (a s {fi')) 7otJv(os(a*0) 



./V(Mo) 

fqN^l) 

fgN&o) 



(3.81) 

This is even more troublesome to do by the Inverse Mellin Transform, so the full set of DGLAP equations 
is almost always solved numerically. 



Note that at higher orders of perturbation theory, even the zero entries in d3.80t become non-zero, 



as do contributions like P qq /(x) 



3.2.6 Violation ofBjorken scaling 

As we already noted, the factorization of initial state singularities introduces a logarithmic Q 2 depen- 
dence into the structure functions and therefore a slow violation of Bjorken scaling. There is a close 
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Fig. 3.6: Fit to the F2 data over a wide range of Q 2 values, exhibiting violation of Bjorken scaling. 

analogy with the renormalization of one-scale cross sections, where the energy-dependence was entirely 
due to the quantum corrections. Although the pdfs at some low scale are entirely non-perturbative and 
must be fit to data, the scale-dependence is entirely predicted by QCD and provides a stringent test over 
a wide range of energy scales. The result is impressive, see Fig. 13.61 

3.3 Summary 

NLO calculations are hard! This is mainly because the real and virtual corrections are each divergent 
and must be regularized in some self-consistent way, for example with dimensional regularization. Un- 
like the ultraviolet divergences, which are isolated in well-localized pieces of the loop calculation and 
can effectively be removed by a redefinition of the Feynman rules, these divergences arise in different 
partonic contributions to physical observables. They must therefore be kept explicit until the very end of 
the calculation when all the partonic contributions are combined. Only then, provided our observable is 
infrared safe, will the real and virtual divergences cancel to yield a finite result. 

Processes with incoming partons have extra divergences, arising from a miscancellation of the 
initial-state-collinear real and virtual contributions, which appear at different points in the integral over 
incoming momentum fraction. (It is worth mentioning that the same argument applies to the final state 
distributions of identified hadrons, for example the momentum distribution of pions produced in e + e~ 
annihilation). These divergences have to be factorized into the non-perturbative, but universal, parton 
distribution functions at some factorization scale fip. This extra scale in the structure functions allows 
them to be Q 2 -dependent. This Q 2 -dependence is entirely driven by the ^-dependence of the parton 
distribution functions, which is predicted by the DGLAP evolution equations. Thus structure function 
data over a wide range of Q 2 provide a stringent test of perturbative QCD. 
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4 Summary 

In this short course on QCD phenomenology, I have resisted the temptation to review the many important 
tests and studies of QCD that have been made over the years and have instead tried to concentrate on the 
key ideas that underpin them. These are: 

• The gauge invariance of the theory, which allows us to write down the Lagrangian and which 
predicts some of the most important features of the theory: the universality of the coupling constant 
and the self-coupling of quarks, which ultimately leads to the negative (3 function and hence to 
asymptotic freedom at high energies and strong interactions at low energies. 

• Renormalization and decoupling, which allow us to make predictive calculations at finite en- 
ergy, without knowing the full structure of the theory to arbitrarily high energy and without the 
introduction of arbitrarily many input parameters. Renormalization is related to the quantum 
structure of the theory and introduces a dimensionful scale into even the scaleless Lagrangian of 
massless QCD, giving rise to energy-dependence of one-scale observables that would be energy- 
independent in the classical theory. 

• Factorization and evolution, which allow us to use perturbation theory to calculate the interactions 
of hadrons, since all the non-perturbative physics gets factorized into universal functions that can 
be measured in one process, like DIS, and then used to predict the cross sections for any other 
process. Again, this introduces a scale dependence into the partem model so that the structure 
functions of DIS, and other one-scale observables such as the Drell-Yan cross section, become 
scale dependent. 

• Infrared safety, which ensure that the infrared singularities associated with soft and collinear emis- 
sion cancel between real and virtual contributions, allowing the perturbative calculation of jet cross 
sections, without a detailed understanding of the mechanism by which partons become jets. 

Together, these allow us to make sense of QCD, without having to solve the theory at all possible scales: 
unknown or uncalculable high- and low-energy effects can be renormalized, factorized and cancelled 
away. After all this, it is remarkable that most QCD phenomenology can be understood at least qual- 
itatively from leading order perturbation theory with the one-loop renormalization group and DGLAP 
evolution equations. Higher order corrections, while essential for quantitative analysis, do not change 
this simple picture dramatically. 

Acknowledgements 

It is a pleasure to acknowledge the organizers, lecturers, tutors and students of the European School of 
High Energy Physics in St. Feliu for making giving these lectures such an enjoyable experience. 

References 

[1] R.K. Ellis, W.J. Stirling and B.R. Webber, QCD and Collider Physics, Cambridge Monographs on 
Particle Physics, Nuclear Physics and Cosmology, volume 8, Cambridge University Press, 1996. 

[2] M.E. Peskin and D.V. Schroeder, An Introduction to Quantum Field Theory, (Addison-Wesley, 
1995). 

[3] S. Bethke, a s 2002, talk presented at QCD 02 High-Energy Physics International Conference on 
Quantum Chromodynamics , Montpellier (France) July 2-9, 2002, hep-ph/02 11012 



45 



